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Trp8, Trp9 and TrplO, novel markers for cancer 



FIELD OF THE INVENTION 

The present invention relates to gene expression in normal cells and cells of malignant 
tumors and particularly to novel markers associated with cancer, Trp8, Trp9 and TrplO, and 
the genes encoding Trp8, Trp9 and TrplO 

BACKROUND OF THE TECHNOLOGY 

Prostate cancer is one of the most common diseases of older men world wide. Diagnosis and 
monitoring of prostate cancer is difficult because of the heterogeneity of the disease. For 
diagnosis different grades of malignancy can be distinguished according to the Gleason-Score 
Diagnosis. For this diagnosis a prostate tissue sample is taken from the patient by biopsy and 
the morphology of the tissue is investigated. However, this approach only yields subjective 
results depending on the experience of the pathologist. For confirmation of these results and 
for obtaining an early diagnosis an additional diagnostic method can be applied which is 
based on the detection of a prostate specific antigen (PSA). PSA is assayed in serum samples, 
blood samples etc. using an anti-PSA-antibody. However, since in principle PSA is also 
expressed in normal prostate tissue there is a requirement for the definition of a threshold 
value (about 4 ng/ml PSA) in order to be able to distinguish between normal and malign 
prostate tissue. Unfortunately, this diagnostic method is quite insensitive and often yields 
false-positive results. Moreover, by using this diagnostic method any conclusions as regards 
the grade of malignancy, the progression of the tumor and its potential for metastasizing 
cannot be drawn. Thus, the use of molecular markers would be helpful to distinguish benign 
from malign tissue and for grading and staging prostate carcinoma, particularly for patients 
with metastasizing prostate cancer having a very bad prognosis. 

The above discussed limitations and failings of the prior art to provide meaningful specific 
markers which correlate with the presence of prostate tumors, in particular metastasizing 
tumors, has created a need for markers which can be used diagnostically, prognostically and 
therapeutically over the course of this disease. The present invention fulfils such a need by the 
provision of Tpr8, Trp9 and TrplO and the genes encoding Trp8, Trp9 and TrplO: The genes 
encoding Trp8 and TrplO are expressed in prostate carcinoma and prostatic metastasis, but 
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not in normal prostate, benign hyperplasia (BHP) and intraepithelial prostatic neoplasia (PIN). 
Furthermore, expression of TrplO transcripts is detectable in carcinoma but not in healthy 
tissue of the lung, the prostate, the placenta and in melanoma. 

SUMMARY OF THE INVENTION 

The present invention is based on the isolation of genes encoding novel markers associated 
witha cancer, Trp8, Trp9 and TrplO. The new calcium channel proteins Trp8, Trp9 and TrplO 
are members of the trp (transient receptor potential) - family, isolated from human placenta 
(Trp8a and Trp8b) and humane prostate (Trp9, Trp 10a and Trp 10b). Trp proteins belong to a 
steadily growing family of Ca 2+ selective and non selective ion channels. In the recent years 
seven Trp proteins (trpl - trp7) have been identified and suggested to be involved in cation 
entry, receptor operated calcium entry and pheromone sensory signaling. Structurally related 
to the trp proteins are the vanilloid receptor (VR1) and the vanilloid like receptor (VRL-1) 
both involved in nociception triggered by heat. Furthermore, two calcium permeable channels 
were identified in rat small intestine (CaTl) and rabbit kidney (ECaC). These distantly related 
channels are suggested to be involved in the uptake of calcium ions from the lumen of the 
small intestine (CaTl) or in the reuptake of calcium ions in the distal tubule of the kidney 
(ECaC). Common features or the Trp and related channels are a proposed structure 
comprising six transmembrane domains including several conserved amino acid motifs. In the 
present invention the cloning and expression of a CaTl like calcium channel (Trp8) from 
human placenta as well as Trp9 and TrplO (two variants, TrplOa and TrplOb) is described. 
Two polymorphic variants of the Trp8 cDNA were isolated from placenta (TrpSa and Trp8b). 
Transient expression of the Trp8b cDNA in HEK (human embryonic kidney) cells results in 
cytosolic calcium overload implicating that the Trp8 channel is constitutive open in the 
expression system. Trp8 induces highly calcium selective inward currents in HEK cells. The 
C -terminus of the Trp8 protein binds calmodulin in a calcium dependent manner. The Trp9 
channel is expressed in trophoblasts and syncytiotrophoblasts of placenta and in pancreatic 
acinar cells. Furthermore, the Trp8 channel is expressed in prostatic carcinoma and prostatic 
metastases, but not in normal tissue of the prostate. No expression of Trp8 transcripts is 
detectable in benign prostatic hyperplasia (BPH) or prostatic intraepithelial neoplasia (PIN). 
Therefore, the Trp8 channel is exclusively expressed in malign prostatic tissues and serves as 
molecular marker for prostate cancer. From the experimental results it is also apparent that the 
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modulation of Trp8 and/or TrplO, e.g. the inhibition of expression or activity , is of 
therapeutic interest, e.g. for the prevention of tumor progression. 



The present invention, thus, provides a Trp8, Trp9 and TrplO protein, respectively, as well as 
nucleic acid molecule encoding the protein and, moreover, an antisense RNA, a ribozyme and 
an inhibitor, which allow to inhibit the expression or the activity of Trp8, Trp9 and/or TrplO. 

In one embodiment, the present invention provides a diagnostic method for detecting a 
prostate cancer or endometrial cancer (cancer of the uterus) associated with Trp8 or TrplO in 
a tissue of a subject, comprising contacting a sample containing Trp8 and/or TrplO encoding 
mRNA with a reagent which detects Trp8 and/or TrplO or the corresponding mRNA. 

In a further embodiment, the present invention provides a diagnostic method for detecting a 
melanoma, chorion carcinoma, cancer of the lung and of the prostate in a tissue of a subject, 
comprising contacting a sample with a reagent which detects TrplOa and/or TrplOb antisense 
transcripts or TrplOa and/or TrplOb related antisense transcripts. 

In another embodiment, the present invention provides a method of treating a prostate tumor, 
carcinoma of the lung, carcinoma of the placenta (chorion carcinoma) or melanoma associated 
with TrpS and/or TrplO, comprising administering to a subject with such an disorder a 
therapeutically effect amount of a reagent which modulates, e.g. inhibits, expression of Trp8 
and/or TrplO or the activity of the protein, e.g. the above described compounds. 

Finally, the present invention provides a method of gene therapy comprising introducing into 
cells of a subject an expression vector comprising a nucleotide sequence encoding the above 
mentioned antisense RNA or ribozyme, in operable linkage with a promoter. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: A, phylogenetic relationship of trp and related proteins. B, hydropathy plot of the 
Trp8 protein sequence according to Kyte and Doolittle. C, alignment of Trp8a/b to the 
epithelial calcium channels ECaC (from rabbit) and Vrl (from rat). Putative transmembrane 
domains are underlined. 
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Figure 2: A, polymorphism of the Trp8 gene. The polymorphic variants Trp8a and TrpSb 
differ in five base pairs resulting in three amino acid exchanges in the derived protein 
sequences. Specific primers were derived from the Trp8 gene as indicated by arrows. B, the 
Trp8a and Trp8b genes are distinguishable by a single restriction site. Genomic fragments of 
the Trp8 gene can be amplified using specific primers (shown in A). The genomic fragment of 
the Trp8b gene contains an additional site of the restriction enzyme BSP 12861 (B). C, the 
Trp8 gene is located on chromosome 7. D, genotyping of eleven human subjects. A 458 bp 
genomic fragment of the Trp8 gene was amplified using specific primers (shown in A) and 
restricted with BSP 12861. The resulting fragments were analyzed by PAGE electrophoresis. 

Figure 3: The Trp8b protein is a calcium selective ion channel. A, representative trace of a 
pdiTrp8b transfected HEK 293 cell. TrpSb mediated currents are activated by voltage ramps 
(-100 mV - +100 mV) of 100 msec at -40 mV or +70 mV holding potential. 1, Trp8b currents 
in the presence at 2mm [Ca 2+ ] D ;2, effect of solution switch alone 3, switch to nominal zero 
calcium solution. B, TrpSb currents in the presence of zero divalent cations. C, current voltage 
relationship of the currents shown in A. Inset, leak subtracted current. D, current voltage 
relationship of the current shown in B. E, statistics of representative experiments. Black: Trp8 
transfected cells, gray: control cells. Columns from left to right: Trp8 currents at - 40 mV 
(n=12) and +70 mV holding potential (n=12). Trp8 currents in standard bath solution 
including 120 mM NMDG without sodium (n = 7) and with nominal zero calcium ions (n = 8) 
or in the presence of ImM EGTA with zero divalent cations (n = 6). F, representative changes 
in [Ca 2+ ]i in TrpSb transfected HEK cells (gray) and controls (black) in the presence or 
absence of ImM [Ca 2+ ] Q . Inset, relative increase of cytosolic calcium concentration of TrpSb 
transfected HEK cells, before and after readdition of 1 mM [Ca 2+ ] 0 in comparison to control 
cells. 

Figure 4 : The C-terminal region of the Trp8 protein binds calmodulin. A, N- and C-terminal 
fragments of the Trp8 protein used for calmodulin binding studies. B, the Trp8 protein and a 
truncated Trp8 protein which was in vitro translated after Muni cut of the cDNA, which lacks 
the C-terminal 32 amino acid residues, were in vitro translated in the presence of 35 S- 
methionine and incubated with calmodulin coupled agarose beads in the presence of 1 mM 
Ca 2+ or 2mM EGTA. C, calmodulin binding to N- and C-terminal fragments of the 
Trp8protein in the presence of Ca 2+ (1 mM) or EGTA (2 mM) 
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Figure 5: Expression pattern of the TrpS cDNA. A, Northern blots (left panels, Clontech, Palo 
Alto) were hybridized using a 348 bp NcoI/BamHT fragment of the Trp9 cDNA. The probe 
hybridizes to mRNA species isolated from the commercial blot, but not to mRNA species 
isolated from benign prostate hyperplasia (right panel, mRNA isolated from 20 human 
subjects with benign prostate hyperplasia). B,C, in situ hybridization with biotinylated TrpS 
specific oligonucleotides on slides of human tissues. Left column antisense probes, right 
column sense probes. D, antinsense probes. 

Figure 6: Differential expression of Trp8 cDNA in human prostate. A-F, in situ 
hybridization with prostatic tissues. A, normal prostate, B, primary carcinoma, C, benign 
hyperplasia, D, rezidive carcinoma, E, prostatic intraepithelial neoplasia, F, lymphnode 
metastasis of the prostata. 

Figure 7: Trp8a cDNA sequence and derived amino acid sequence 

Figure 8: A, Trp8b cDNA sequence and derived amino acid sequence 

B, cDNA sequence of splice variant 1 (12B1) 

C, cDNA sequence of splice variant 2 (17-3) 

D, cDNA sequence of splice variant 3 (23 A3) 

E, cDNA sequence of splice variant 4 (23 C3) 

Figure 9: A, Trp9 cDNA sequence and derived amino acid sequence B, cDNA sequence of 
splice variant 15 and derived amino acid sequence. 

Figure 10: A, cDNA sequence of TrplOa and derived amino acid sequence, B, cDNA 
fragment of TrplOa and derived amino acid sequence. 

Figure 1 1 : cDNA sequence of TrplOb and derived amino acid sequence. 
Figure 12: Expression of TrpS mRNA in human endometrial cancer or cancer of the uterus. A 
- D, in situ hybridization with slides of endometrial cancer hybridized with Trp8 antisense 
(left column) or sense probes as controls (right column). E - F, Trp8 antisense probes 
hybridized to slides of normal endometrium. It can be clearly seen no hybridization occurs 
with normal endometrial tissue. 
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Figure 13: Expression of human Trp9 and Tip 10 genes 

Northern blots were hybridized using Trp9 (upper panel) or TrplO (lower panel) specific 
probes. Expression of the Trp9 cDNA is detectable in many tissues including human prostate 
and colon as well as in benign prostatic hyperplasia. Expression of TrplO cDNA is detectable 
in human prostate of a commercial northern blot (Clontech, right side). This Northern blot 
contains prostatic tissue collected from 15 human subjects in the range of 14 - 60 years of age. 
No expression of TrplO cDNA was detectable in benign prostatic hyperplasia (left side). 

Figure 14 : Expression of TrplO transcripts and TrplO-antisense transcripts in human prostate 
cancer and metastasis of a melanoma. In situ hybridizations of slides hybridized with TrplO- 
antisense (A-E, K-N) and TrplO related sense probes (F-J, P-R). It can clearly be seen that 
both probes detect the same cancer cells indicating that these cancer cells express TrplO 
transcripts as well as TrplO-antisense transcripts. S, no TrplO expression is detectable in 
benign hyperplasia of the prostate (BPH). O and T, show expression of TrplO transcripts (O) 
and TrplO-antisense transcripts (T) in a metastasis of a melanoma in human lung. Melanoma 
cancer cells express both TrplO transcripts and TrplO-antisense transcripts. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated nucleic acid molecule encoding the human 
prostate carcinoma associated protein TrpSa, Trp8b, Trp9, TrplOa or TrplOb or a protein 
exhibiting biological properties of Trp8a, TrpSb, Trp9, TrplOa or TrplOb and being selected 
from the group consisting of 

(a) a nucleic acid molecule encoding a protein that comprises the amino acid sequence 
depicted in Figure 7, 8 A, 9,10 or 11; 

(b) a nucleic acid molecule comprising the nucleotide sequence depicted in Figure 7, 8A, 
9,10, or 11; 

(c) a nucleic acid molecule included in DSMZ Deposit no. DSM 13579 (deposit date: 28 
June 2000), DSM 13580 (deposit date: 28 June 2000), DSM 13584 (deposit date: 5 
July 2000), DSM 13581 (deposit date: 28 June 2000) or DSM ....(deposit date:....); 

(d) a nucleic acid molecule with hybridizes to a nucleic acid molecule specified in (a) to 
(c) 
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(e) a nucleic acid molecule the nucleic acid sequence of which deviates from the nucleic 
sequences specified in (a) to (d) due to the degeneration of the genetic code; and 

(f) a nucleic acid molecule, which represents a fragment, derivative or allelic variation of 
a nucleic acid sequence specified in (a) to (e). 

As used herein, a protein exhibiting biological properties of Trp8a, TrpSb, Trp9,Trpl0a or 
Trp 10b is understood to be a protein having at least one of the activities as illustrated in the 
Examples, below. 

As used herein, the term „isolated nucleic acid molecule,, includes nucleic acid molecules 
substantially free of other nucleic acids, proteins, lipids, carbohydrates or other materials with 
which it is naturally associated. 

In a first embodiment, the invention provides an isolated nucleic acid molecule encoding the 
human prostate carcinoma associated protein TrpSa, TrpSb, Trp9,Trpl0a or TrplOb 
comprising the amino acid sequence depicted in Figure 7, 8 A, 9,10 or 11. The present 
invention also provides a nucleic acid molecule comprising the nucleotide sequence depicted 
in Figure 7, 8A, 9,10 or 11. 



The present invention provides not only the generated nucleotide sequence identified in 
Figure 7, 8 A, 9,10 or 11, respectively and the predicted translated amino acid sequence, 
respectively, but also plasmid DNA containing a Trp8a cDNA deposited with the DSMZ, 
under DSM 13579, a Trp8b cDNA deposited with the DSMZ, under DSM 13580, a Trp9 
cDNA deposited with the DSMZ, under DSM 13584, a TrplOa cDNA deposited with the 
DSMZ, under DSM 13581, and a TrplOb cDNA deposited with the DSMZ, under DSM...., 
respectively. The nucleotide sequence of each deposited Trp-clone can readily be determined 
by sequencing the deposited clone in accordance with known methods. The predicted amino 
acid sequence can then be verified from such deposits. Moreover, the amino acid sequence of 
the protein encoded by each deposited clone can also be directly determined by peptide 
sequencing or by expressing the protein in a suitable host cell containing the deposited Trp- 
encoding DNA, collecting the protein, and determining its sequence. 

The nucleic acid molecules of the invention can be both DNA and RNA molecules. Suitable 
DNA molecules are, for example, genomic or cDNA molecules. It is understood that all 
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nucleic acid molecules encoding all or a portion of TrpSa, TrpSb, Trp9,Trpl0a or TrplOb are 
also included, as long as they encode a polypeptide with biological activity. The nucleic acid 
molecules of the invention an be isolated from natural sources or can be synthesized 
according to know methods. 

The present invention also provides nucleic acid molecules which hybridize to the above 
nucleic acid molecules. As used herein, the term „hybridize„ has the meaning of hybridization 
under conventional hybridization conditions, preferably under stringent conditions as 
described, for example, in Sambrook et al., Molecular Cloning, A Laboratory Manual 2 nd 
edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Also 
contemplated are nucleic acid molecules that hybridize to the Trp nucleic acid molecules at 
lower stringency hybridization conditions. Changes in the stringency of hybridization and 
signal detection are primarily accomplished through the manipulation of formamide 
concentration (lower percentages of formamide result in lowered stringency), salt conditions, 
or temperature. For example, lower stringency conditions include an overnight incubation at 
37°Cin a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 9.2M NaH 2 P0 4 ; 0.02M 
EDTA, pH7.4), 0.5% SDS, 30% formamide, 100 ng/ml salmon sperm blocking DNA, 
following by washes at 50°C with 1 X SSPE, 0.1% SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). Variations in the above conditions may be accomplished 
through the inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. The inclusion of specific blocking reagents may 
require modification of the hybridization conditions described above, due to problems with 
compatibility. 

Nucleic acid molecules that hybridize to the molecules of the invention can be isolated, e.g., 
from genomic or cDNA libraries that were produced from human cell lines or tissues. In order 
to identify and isolate such nucleic acid molecules the molecules of the invention or parts of 
these molecules or the reverse complements of these molecules can be used, for example by 
means of hybridization according to conventional methods (see, e.g., Sambrook et al, supra). 
As a hybridization probe nucleic acid molecules can be used, for example, that have exactly 
or basically the nucleotide sequence depicted in Figure 7, 8 A, 9,10 or 11, respectively, or 
parts of these sequences. The fragments used as hybridization probe can be synthetic 
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fragments that were produced by means of conventional synthetic methods and the sequence 
of which basically corresponds to the sequence of a nucleic acid molecule of the invention. 



The nucleic acid molecules of the present invention also include molecules with sequences 
that are degenerate as a result of the genetic code. 

In a further embodiment, the present invention provides nucleic acid molecules which 
comprise fragments, derivatives and allelic variants of the nucleic acid molecules described 
above encoding a protein of the invention. „Fragments„ are understood to be parts of the 
nucleic acid molecules that are long enough to encode one of the described proteins. These 
fragments comprise nucleic acid molecules specifically hybridizing to transcripts of the 
nucleic acid molecules of the invention. These nucleic acid molecules can be used, for 
example, as probes or primers in the diagnostic assay and/or kit described below and, 
preferably, are oligonucleotides having a length of at least 10, in particular of at least 15 and 
particularly preferred of at least 50 nucleotides. The nucleic acid molecules and 
oligonucleotides of the invention can also be used, for example, as primers for a PCR 
reaction. Examples of particular useful probes (primers) are shown in Tables 1 and 2. 

Table 1 

Trp8 probes used for in situ hybridization: 
Probes (antisense) 

1)5' TCCGCTGCCGGTTGAGATCTTGCC 3' 

2. ) 5' CTTGCTCCATAGGCAGAGAATTAG 3' 

3. ) 5' ATCCTC AGAGCCCCGGGTGTGGAA3 9 

Controls (sense) 

1. ) 5' GGCAAGATCTCAACCGGCAGCGGA 3' 

2. ) 5' CTAATTCTCTGCCTATGGAGCAAG 3' 

3. ) 5' TTCCACACCCGGGGCTCTGAGGAT 3' 

Tabelle 2 

TrplO probes used for the in situ hybridizations shown in Figure 14: 

9 



<WO 0210382A2_I_> 



WO 02/10382 



PCT/EP01/08309 



Probes (antisense) 

1. ) 5' GCTTCCACCCCAAGCTTCACAGGAATAGA 3 9 (Figure 14 A, 14B) 

2. ) 5' GGCGATGAAATGCTGGTCTGTGGC 3' (Figure 14C, 14D, 14N, 14S, 140) 

3. ) 5' ATCTTCCAGTTCTTGGTGTCTCGG 3' (Figure 14E, 14K) 

4. ) 5' GCTGCAGTACTCCTGCACCAGGAA 3' (Figure 14L, 14M) 

Probes (sense) 

1. ) 5' TCTATTCCTGTGAAGCTTGGGGTGGAAGC 3' (Figure 14F, 14G) 

2. ) 5' GCCACAGACCAGCATTTCATCGCC 3' (Figure 14H, 141, 14T) 

3. ) 5' CCGAGACACCAAGAACTGGAAGAT 3' (Figure 14J, 14P) 

4. ) 5' TTCCTGGTGCAGGAGTACTGCAGC 3' (Figure 14Q, 14R) 



The term ^derivative,, in this context means that the sequences of these molecules differ from 
the sequences of the nucleic acid molecules described above at one or several positions but 
have a high level of homology to these sequences. Homology hereby means a sequence 
identity of at least 40%, in particular an identity of at least 60%, preferably of more than 80% 
and particularly preferred of more than 90%. These proteins encoded by the nucleic acid 
molecules have a sequence identity to the amino acid sequence depicted in Figure 7, 8 A, 9, 10 
and 11, respectively, of at least 80%, preferably of 85% and particularly preferred of more 
than 90%, 97% and 99%. The deviations to the above-described nucleic acid molecules may 
have been produced by deletion, substitution, insertion or recombination. The definition of the 
derivatives also includes splice variants, e.g. the splice variants shown in Figures 8B to 8E 
and 9B. 

The nucleic acid molecules that are homologous to the above-described molecules and that 
represent derivatives of these molecules usually are variations of these molecules that 
represent modifications having the same biological function. They can be naturally occurring 
variations, for example sequences from other organisms, or mutations that can either occur 
naturally or that have been introduced by specific mutagenesis. Furthermore the variations can 
be synthetically produced sequences. The allelic variants can be either naturally occurring 
variants or synthetically produced variants or variants produced by recombinant DNA 
processes. 
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Generally, by means of conventional molecular biological processes it is possible (see, e.g., 
Sambrook et al., supra) to introduce different mutations into the nucleic acid molecules of the 
invention. As a result Trp proteins or Trp related proteins with possibly modified biological 
properties are synthesized. One possibility is the production of deletion mutants in which 
nucleic acid molecules are produced by continuous deletions from the 5'- or 3 '-terminal of the 
coding DNA sequence and that lead to the synthesis of proteins that are shortened 
accordingly. Another possibility is the introduction of single-point mutation at positions 
where a modification of the amino aid sequence influences, e.g., the ion channel properties or 
the regulations of the trp-ion channel. By this method muteins can be produced, for example, 
that possess a modified ion conducting pore, a modified Km-value or that are no longer subject 
to the regulation mechanisms that normally exist in the cell, e.g. with regard to allosteric 
regulation or covalent modification. Such muteins might also be valuable as therapeutically 
useful antagonists of Trp8a, Trp8b, Trp9,Trpl0a or Trp 10b, respectively. 

For the manipulation in prokaryotic cells by means of genetic engineering the nucleic acid 
molecules of the invention or parts of these molecules can be introduced into plasmids 
allowing a mutagenesis or a modification of a sequence by recombination of DNA sequences. 
By means of conventional methods (cf. Sambrook et al., supra) bases can be exchanged and 
natural or synthetic sequences can be added. In order to link the DNA fragments with each 
other adapters or linkers can be added to the fragments. Furthermore, manipulations can be 
performed that provide suitable cleavage sites or that remove superfluous DNA or cleavage 
sites. If insertions, deletions or substitutions are possible, in vitro mutagenesis, primer repair, 
restriction or ligation can be performed. As analysis method usually sequence analysis, 
restriction analysis and other biochemical or molecular biological methods are used. 

The proteins encoded by the various variants of the nucleic acid molecules of the invention 
show certain common characteristics, such as ion channel activity, molecular weight, 
immunological reactivity or conformation or physical properties like the electrophoretical 
mobilty, chromatographic behavior, sedimentation coefficients, solubility, spectroscopic 
properties, stability; pH optimum, temperature optimum. 

The invention furthermore relates to vectors containing the nucleic acid molecules of the 
invention. Preferably, they are plasmids, cosmids, viruses, bacteriophages and other vectors 
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usually used in the field of genetic engineering. Vectors suitable for use in the present 
invention include, but are not limited to the T7-based expression vector for expression in 
mammalian cells and baculovirus-derived vectors for expression in insect cells. Preferably, 
the nucleic acid molecule of the invention is operatively linked to the regulatory elements in 
the recombinant vector of the invention that guarantee the transcription and synthesis of an 
RNA in prokryotic and/or eukaryotic cells that can be translated. The nucleotide sequence to 
be transcribed can be operably linked to a promoter like a T7, metallothionein I or polyhedrin 
promoter. 

In a further embodiment, the present invention relates to recombinant host cells transiently or 
stable containing the nucleic acid molecules or vectors or the invention. A host cell is 
understood to be an organism that is capable to take up in vitro recombinant DNA and, if the 
case may be, to synthesize the proteins encoded by the nucleic acid molecules of the 
invention. Preferably, these cells are prokaryotic or eukaryotic cells, for example mammalian 
cells, bacterial cells, insect cells or yeast cells. The host cells of the invention are preferably 
characterized by the fact that the introduced nucleic acid molecule of the invention either is 
heterologous with regard to the transformed cell, i.e. that it does not naturally occur in these 
cells, or is localized at a place in the genome different from that of the corresponding 
naturally occurring sequence. 

A further embodiment of the invention relates to isolated proteins exhibiting biological 
properties of the human prostate carcinoma associated protein Trp8a, Trp8b, Trp9,Trpl0a or 
TrplOb and being encoded by the nucleic acid molecules of the invention, as weil as to 
methods for their production, whereby, e.g., a host cell of the invention is cultivated under 
conditions allowing the synthesis of the protein and the protein is subsequently isolated from 
the cultivated cells and/or the culture medium. Isolation and purification of the recombinantly 
produced proteins may be carried out by conventional means including preparative 
chromatography and affinity and immunological separations involving affinity with an anti- 
Trp8a-, anti-Trp8b-, anti-Trp9-,anti-Trpl0a- or anti-Trpl Ob-antibody, respectively. 

As used herein, the term „isolated protein,, includes proteins substantially free of other 
proteins, nucleic acids, lipids, carbohydrates or other materials with which it is naturally 
associated. Such proteins however not only comprise recombinantly produced proteins but 
include isolated naturally occurring proteins, synthetically produced proteins, or proteins 
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produced by a combination of these methods. Means for preparing such proteins are well 
understood in the art. The Trp proteins are preferably in a substantially purified form. A 
recombinantly produced version of a human prostate carcinoma associated protein Trp8a, 
Trp8b, Trp9,Trpl0a or Trp 10b protein, including the secreted protein, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67; 3 1-40 (1988). 

In a further preferred embodiment, the present invention relates to an antisense RNA 
sequence characterised that it is complementary to an mRNA transcribed from a nucleic acid 
molecule of the present invention or a part thereof and can selectively bind to said mRNA, 
said sequence being capable of inhibiting the synthesis of the protein encoded by said nucleic 
acid molecules, and a ribozyme characterised in that it is complementary to an mRNA 
transcribed from a nucleic acid molecule of the present invention or a part thereof and can 
selectively bind to and cleave said mRNA, thus inhibiting the synthesis of the proteins 
encoded by said nucleic acid molecules. Riboyzmes which are composed of a single RNA 
chain are RNA enzymes, i.e. catalytic RNAs, which can intermolecularly cleave a target 
RNA, for example the mRNA transcribed from one of the Trp genes. It is now possible to 
construct ribozymes which are able to cleave the target RNA at a specific site by following 
the strategies described in the literature, (see, e.g., Tanner et al., in: Antisense Research and 
Applications, CRC Press Inc. (1993), 415-426). The two main requirements for such 
ribozymes are the catalytic domain and regions which are complementary to the target RNA 
and which allow them to bind to its substrate, which is a prerequisite for cleavage. Said 
complementary sequences, i.e., the antisense RNA or ribozyme, are useful for repression of 
TrpSa-, TrpSb, Trp9-,Trpl0a- and Trp 1 Ob-expression, respectively, i.e. in the case of the 
treatment of a prostate cancer or endometrial cancer (carcinoma of the uterus). Preferably, the 
antisense RNA and ribozyme of the invention are complementary to the coding region. The 
person skilled in the art provided with the sequences of the nucleic acid molecules of the 
present invention will be in a position to produce and utilise the above described antisense 
RNAs or ribozymes. The region of the antisense RNA and ribozyme, respectively, which 
shows complementarity to the mRNA transcribed from the nucleic acid molecules of the 
present invention preferably has a length of at least 10, in particular of at least 15 and 
particularly preferred of at least 50 nucleotides. 

In still a further embodiment, the present invention relates to inhibitors of TrpSa, TrpSb, Trp9, 
Trp 10a and Trp 10b, respectively, which fulfill a similar purpose as the antisense RNAs or 
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ribozymes mentioned above, i.e. reduction or elimination of biologically active Trp8a, TrpSb, 
Trp9, Tip 10a or Tip 10b molecules. Such inhibitors can be, for instance, structural analogues 
of the corresponding protein that act as antagonists. In addition, such inhibitors comprise 
molecules identified by the use of the recombinantly produced proteins, e.g. the 
recombinantly produces protein can be used to screen for and identify inhibitors, for example, 
by exploiting the capability of potential inhibitors to bind to the protein under appropriate 
conditions. The inhibitors can, for example, be identified by preparing a test mixture wherein 
the inhibitor candidate is incubated with Trp8a, TrpSb, Trp9, TrplOa or TrplOb, respectively, 
under appropriate conditions that allow Trp8a, TrpSb, Trp9, TrplOa or TrplOb to be in a 
native conformation. Such an in vitro test system can be established according to methods 
well known in the art. Inhibitors can be identified, for example, by first screening for either 
synthetic or naturally occurring molecules that bind to the recombinantly produced Tip 
protein and then, in a second step, by testing those selected molecules in cellular assays for 
inhibition of the Tip protein, as reflected by inhibition of at least one of the biological 
activities as described in the examples, below. Such screening for molecules that bind TrpSa, 
TrpSb, Trp9, TrplOa or TrplOb could easily performed on a large scale, e.g. by screening 
candidate molecules from libraries of synthetic and/or natural molecules. Such an inhibitor is, 
e.g., a synthetic organic chemical, a natural fermentation product, a substance extracted from 
a microorganism, plant or animal, or a peptide. Additional examples of inhibitors are specific 
antibodies, preferably monoclonal antibodies. Moreover, the nucleic sequences of the 
invention and the encoded proteins can be used to identify further factors involved in tumor 
development and progression. In this context it should be emphasized that the modulation of 
the calcium channel of a member of the trp family can result in the stimulation of the immune 
response of T lymphocytes leading to proliferation of the T lymphocytes. The proteins of the 
invention can, e.g., be used to identify further (unrelated) proteins which are associated with 
the tumor using screening methods based on protein/protein interactions, e.g. the two-hybrid- 
system Fields, S. and Song, O. (1989) Nature (340): 245-246. 

The present invention also provides a method for diagnosing a prostate carcinoma which 
comprises contacting a target sample suspected to contain the protein Trp8a, Trp8b, TrplOa 
and/or TrplOb or the TrpSa, Trp8b, TrplOa and/or TrplOb encoding mRNA with a reagent 
which reacts with Trp8a, TrpSb, TrplOa and/or TrplOb or the Trp8a, TrpSb, TrplOa and/or 
TrplOb encoding mRNA and detecting TrpSa, Trp8b, TrplOa and/or TrplOb or TrpSa, TrpSb, 
TrplOa and/or Trp 10b encoding mRNA. 
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It has been found that carcinoma cells of placenta (chorion carcinoma), lung and prostate 
express TrplO transcripts as well as TrplO antisense transcripts and transcripts being in part 
complementary to TrplO antisense transcripts. Accordingly, the present invention also 
provides a method for diagnosing a melanoma, chorion carcinoma, cancer of the lung and of 
the prostate in a tissue of a subject, comprising contacting a sample with a reagent which 
detects TrplOa and/or TrplOb antisense RNA. 

When the target is mRNA (or antisense RNA), the reagent is typically a nucleic acid probe or 
a primer for PCR. The person skilled in the art is in a position to design suitable nucleic acids 
probes based on the information as regards the nucleotide sequence of Trp8a, Trp8b, TrplOa 
or TrplOb as depicted in figure 7, 8a, 10 and 11, respectively, or tables 1 and 2, above. When 
the target is the protein, the reagent is typically an antibody probe. The term „antibody", 
preferably, relates to antibodies which consist essentially of pooled monoclonal antibodies 
with different epitopic specifities, as well as distinct monoclonal antibody preparations. 
Monoclonal antibodies are made from an antigen containing fragments of the proteins of the 
invention by methods well known to those skilled in the art (see, e.g., Kohler et al., Nature 
256 (1975), 495). As used herein, the term „antibody" (Ab) or „monoclonal antibody" (Mab) 
is meant to include intact molecules as well as antibody fragments (such as, for example, Fab 
and F(ab') 2 fragments) which are capable of specifically binding to protein. Fab and f(ab')2 
fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, and 
may have less non-specific tissue binding than an intact antibody. (Wahl et al., J. Nucl. Med. 
24: 316-325 (1983)). Thus, these fragments are preferred, as well as the products of a FAB or 
other immunoglobulin expression library. Moreover, antibodies of the present invention 
include chimerical, single chain, and humanized antibodies. The target cellular component, 
i.e. Trp8a, Trp8b, TrplOa and/or TrplOb or TrpSa, TrpSb, TrplOa and/or TrplOb encoding 
mRNA or TrplOa/b antisense transcripts, e.g., in biological fluids or tissues, may be detected 
directly in situ, e.g. by in situ hybridization (e.g., according to the examples, below) or it may 
be isolated from other cell components by common methods known to those skilled in the art 
before contacting with a probe. Detection methods include Northern blot analysis, RNase 
protection, in situ methods, e.g. in situ hybridization, in vitro amplification methods (PCR, 
LCR, QRNA replicase or RNA-transcription/amplification (TAS, 3SR), reverse dot blot 
disclosed in EP-B1 O 237 362)), immunoassays, Western blot and other detection assays that 
are known to those skilled in the art. 
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Products obtained by in vitro amplification can be detected according to established methods, 
e.g. by separating the products on agarose gels and by subsequent staining with ethidium 
bromide. Alternatively, the amplified products can be detected by using labeled primers for 
amplification or labeled dNTPs. 

The probes can be detectable labeled, for example, with a radioisotope, a bioluminescent, 
compound, a chemiluminescent compound, a fluorescent compound, a metal chelate, or an 
enzyme. 

Expression of TrpSa, Trp8b, TrplOa and TrplOb, respectively, in tissues can be studied with 
classical immunohistological methods (Jalkanen et aL, J. Cell. Biol. 101 (1985), 976-985; 
Jalkanen et al., J. Cell. Biol. 105 (1987), 3087-3096; Sobol et al. Clin. Immunpathol. 24 
(1982), 139-144; Sobol et al., Cancer 65 (1985), 2005-2010). Other antibody based methods 
useful for detecting protein gene expression include immunoassays, such as the enzyme 
linked immunosorbent assay (ELISA) and the radioimmunoassay (RIA). Suitable antibody 
assay labels are known in the art and include enzyme labels, such as, glucose oxidase, and 
radioisotopes, such as iodine ( 125 I, 121 I), carbon ( 14 C), sulfur ( 35 S), tritium ( 3 H), indium ( 112 In), 
and technetium rhodamine, and biotin. In addition to assaying Trp8a, Trp8b, Trp 10a or- 
TrplOb levels in a biological sample, the protein can also be detected in vivo by imaging. 
Antibody labels or markers for in vivo imaging of protein include those detectable by X- 
radiography, NMR or ESR. For X-radiography, suitable labels include radioisotopes such as 
barium or cesium, which emit detectable radiation but are not overtly harmful to the subject. 
Suitable markers for NMR and ESR include those with a detectable characteristic spin, such 
as deuterium, which may be incorporated into the antibody by labeling of nutrients for the 
relevant hybridoma. A protein-specific antibody or antibody fragment which has been labeled 

13 1 1 12 

with an appropriate detectable imaging moiety, such as a radioisotope (for example, I, In, 
99 mTc), a radio-opaque substance, or a material detectable by nuclear magnetic resonance, is 
introduced (for example, parenterally, subcutaneously, or intraperitoneally) into the mammal. 
It will be understood in the art that the size of the subject and the imaging system used will 
determine the quantity of imaging moiety needed to produce diagnostic images. In the case of 
a radioisotope moiety, for a human subject, the quantity of radioactivity injected will normally 
range from about 5 to 20 millicuries of "mTc. The labeled antibody or antibody fragment will 
then preferentially accumulate at he location of cells which contain the specific protein. In 
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vivo tumor imaging is described in S.W. Burchiel et al., ,,Immunopharmacokinetics of 
Radiolabeled Antibodies and Their Fragments". (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer, S.W. Burchiel and B.A. Rhodes, eds., Masson 
Publishing Inc. (1982)). 

The marker Trp8a and TrpSb is also useful for prognosis, for monitoring the progression of 
the tumor and the diagnostic evaluation of the degree of malignancy of a prostate tumor 
(grading and staging), e.g. by using in situ hybridization: In a primary carcinoma Trp8 is 
expressed in about 2 to 10% of carcinoma cells, in a rezidive carcinoma in about 10 to 60% of 
cells and in metastases in about 60 to 90% of cells. 

The present invention also relates to a method for diagnosing endometrial cancer (cancer of 
the uterus) which comprises contacting a target sample suspected to contain the protein Trp8a 
and/or Trp8b or the Trp8a and/or Trp8b encoding mRNA with a reagent which reacts with 
Trp8a and/or Trp8b or the encoding mRNA and detecting Trp8a and/or Trp8b encoding 
mRNA. As regards particular embodiments of this method reference is made to the particular 
embodiments of the method of diagnosing a prostate cancer outlined above. 

For evaluating whether the concentration of Trp8a, Trp8b, TrplOa or Tip 10b or the 
concentration of Trp8a, TrpSb, TrplOa or TrplOb encoding mRNA is normal or increased, 
thus indicative for the presence of a malignant tumor, the measured concentration is compared 
with the concentration in a normal tissue, preferably by using the ratio of Trp8a:Trp9, 
Trp8b:Trp9 or Trpl0(a or b)/Trp9 for quantification. 

Since the prostate carcinoma forms its own basement membrane when growing invasively, it 
can be concluded that only cells expressing Trp8 and TrplO are involved in this phenomenon. 
Thus, it can be concluded that by inhibiting the expression and/or activity of these proteins an 
effective therapy of cancers like PCA is provided. 

Thus, the present invention also relates to a pharmaceutical composition containing a reagent 
which decreases or inhibits Trp8a, Trp8b, TrplOa and/or TrplOb expression or the activity of 
Trp8a, Trp8b, TrplOa and/or TrplOb, and a method for preventing, treating, or ameliorating a 
prostate tumor, endometrial cancer (uterine carcinoma) tumor, a chorion carcinoma, cancer of 
the lung or melanoma, which comprises administering to a mammalian subject a 
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therapeutically effective amount of a reagent which decreases or inhibits Trp8a, Trp8b, 
TrplOa and/or TrplOb expression or the activity of TrpSa, TrpSb, TrplOa and/or TrplOb. 
Examples of such reagents are the above described, antisense RNAs, ribozymes or inhibitors, 
e.g. specific antibodies. Furthermore, peptides, which inhibit or modulate the biological 
function of TrpSa, Trp8b, Trp9, TrplOa and/or TrplOb may be useful as therapeutical 
reagents. For example, these peptides can be obtained by screening combina torial phage 
display libraries (Cosmix, Braunschweig, Germany) as described by Rottgen, P. and Collins, 
J. (Gene (1995) 164 (2): 243-250). Furthermore, antigenic epitopes of the TrpS and TrplO 
proteins can be identified by the expression of recombinant Trp8 and TrplO epitope libraries 
in E. coli (Marquart, A. & Flockerzi, V., FEES Lett. 407 (1997), 137-140; Trost, C, et al., 
FEBS Lett. 451 (1999) 257-263 and the consecutive screening of these libraries with serum of 
patients with cancer of the prostate or of the endometrium. Those Trp8 and TrplO epitopes 
which are immunogenic and which lead to the formation of antibodies in the serum of the 
patients can be then be used as Trp8 or TrplO derived peptide vaccines for immune inventions 
against cancer cells which express TrpS or TrplO. Alternatively to the E. coli expression 
system, Trp8 or TrplO or epitopes of Trp8 and TrplO can be expressed in mammalian cell 
lines such as human embryonic kidney (Hek 293) cells (American Type Culture Collection, 
ATCC CRL 1573). 

Finally, compounds useful for therapy of the above described diseases comprise compounds 
which act as antagonists or agonists on the ion channels Trp8, Trp9 and TrplO. It could be 
shown that Trp8 is a highly calcium selective ion channel which in the presence of 
monovalent (namely sodium) and divalent ions (namely calcium) is only permeable for 
calcium ions (see Example 4, below, and Figures 3A, C, E). Under physiological conditions, 
TrpS is a calcium selective channel exhibiting large inward currents. This very large 
conductance of TrpS channels (as wells as Trp9 and TrplOato channels) is useful to establish 
systems for screening pharmacological compounds interacting with Trp-channels including 
high throughput screening systems. Useful high throughput screening systems are well known 
to the person skilled in the art and include, e.g., the use of cell lines stably or transiently 
transfected with DNA sequences encoding Trp8, Trp9 and TrplO channels in assays to detect 
calcium signaling in biological systems. Such systems include assays based on Ca-sensitive 
dyes such as aequorin, apoaequorin, Fura-2, Fluo-3 and Indo-1. 
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Accordingly, the present invention also relates to a method for identifying compounds which 
act as agonists or antagonists on the ion channels Trp8, Trp9 and/or TrplO, said method 
comprising contacting a test compound with the ion channel TrpS, Trp9 and/or TrplO, 
preferably by using a system based on cells stably or transiently transfected with DNA 
sequences encoding TrpS, Trp9 and/or TrplO, and determining whether said test compound 
affects the calcium uptake. 

For administration the above described reagents are preferably combined with suitable 
pharmaceutical carriers. Examples of suitable pharmaceutical carriers are well known in the 
art and include phosphate buffered saline solutions, water, emulsions, such as oil/water 
emulsions, various types of wetting agents, sterile solutions etc. Such carriers can be 
formulated by conventional methods and can be administered to the subject at a suitable dose. 
Administration of the suitable compositions may be effected by different ways, e.g. by 
intravenous, intraperetoneal, subcutaneous, intramuscular, topical or intradermal 
administration. The route of administration, of course, depends on the nature of the tumor and 
the kind of compound contained in the pharmaceutical composition. The dosage regimen will 
be determined by the attending physician and other clinical factors. As is well known in the 
medical arts, dosages for any one patient depends on many factors, including the patient's 
size, body surface area, age, sex, the particular compound to be administered, time and route 
of administration, the kind and stage of the tumor, general health and other drugs being 
administered concurrently. 

The delivery of the antisense RNAs or ribozymes of the invention can be achieved by direct 
application or, preferably, by using a recombinant expression vector such as a chimeric virus 
containing these compounds or a colloidal dispersion system. By delivering these nucleic 
acids to the desired target, the intracellular expression of Trp8a, Trp8b, TrplOa and/or TrplOb 
and, thus, the level of TrpSa, TrpSb, TrplOa and/or TrplOb can be decreased resulting in the 
inhibition of the negative effects of TrpSa, Trp8b, TrplOa and/or TrplOb, e.g. as regards the 
metastasis formation of PC A. 

Direct application to the target site can be performed, e.g., by ballistic delivery, as a colloidal 
dispersion system or by catheter to a site in artery. The colloidal dispersion systems which can 
be used for delivery of the above nucleic, acids include macromolecule complexes, 
nanocapsules, microspheres, beads and lipid-based systems including oil-in-water emulsions 
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(mixed), micelles, liposomes and Iipoplexes, The preferred colloidal system is a liposome. 
The composition of the liposome is usually a combination of phospholipids and steroids, 
especially cholesterol. The skilled person is in a position to select such liposomes which are 
suitable for the delivery of the desired nucleic acid molecule. Organ-specific or cell-specific 
liposomes can be used in order to achieve delivery only to the desired tumor. The targeting of 
liposomes can be carried out by the person skilled in the art by applying commonly known 
methods. This targeting includes passive targeting (utilizing the natural tendency of the 
liposomes to distribute to cells of the RES in organs which contain sinusoidal capillaries) or 
active targeting (for example by coupling the liposome to a specific ligand, e.g., an antibody, 
a receptor, sugar, glycolipid, protein etc., by well known methods). In the present invention 
monoclonal antibodies are preferably used to target liposomes to specific tumors via specific 
cell-surface ligands. 

Preferred recombinant vectors useful for gene therapy are viral vectors, e.g. adenovirus, 
herpes virus, vaccinia, or, more preferably, an RNA virus such as a Retrovirus. Even more 
preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples of 
such retroviral vectors which can be used in the present invention are: Moloney murine 
leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary 
tumor virus (MuMTV) and Rous sarcoma virus (RSV). Most preferably, a non-human 
primate retroviral vector is employed, such as the gibbon ape leukemia virus (GaLV), 
providing a broader host range compared to murine vectors. Since recombinant retroviruses 
are defective, assistance is required in order to produce infectious particles. Such assistance 
can be provided, e.g., by using helper cell lines that contain plasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences within the LTR. 
Suitable helper cell lines are well known to those skilled in the art. Said vectors can 
additionally contain a gene encoding a selectable marker so that the transduced cells can be 
identified. Moreover, the retroviral vectors can be modified in such a way that they become 
target specific. This can be achieved, e.g., by inserting a polynucleotide encoding a sugar, a 
glycolipid, or a protein, preferably an antibody. Those skilled in the art know additional 
methods for generating target specific vectors. Further suitable vectors and methods for in 
vitro- or in vivo-gene therapy are described in the literature and are known to the persons 
skilled in the art; see, e.g., WO 94/29469 or WO 97/00957. 
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In order to achieve expression only in the target organ, i.e. tumor to be treated, the nucleic 
acids encoding, e.g. an antisense RNA or ribozyme can also be operably linked to a tissue 
specific promoter and used for gene therapy. Such promoters are well known to those skilled 
in the art (see e.g. Zimmermann et al., (1994) Neuron 12, 1 1-24; Vidal et al.; (1990) EMBO J. 
9, 833-840; Mayford et al, (1995), Cell 81, 891-904; Pinkert et al., (1987) Genes & Dev. 1, 
268-76). 

For use in the diagnostic research discussed above, kits are also provided by the present 
invention. Such kits are useful for the detection of a target cellular component, which is 
TrpSa, TrpSb, TrplOa and/or TrplOb or, alternatively, Trp8a, Trp8b, TrplOa and/or TrplOb 
encoding mRNA or TrplOa/b antisense transcripts, wherein the presence or an increased 
concentration of Trp8a, TrpSb, TrplOa and/or TrplOb or, alternatively, Trp8a, TrpSb, TrplOa 
and/or TrplOb encoding mRNA or TrplOaft antisense transcripts is indicative for a prostate 
tumor, endometrial cancer, melanoma, chorion carcinoma or cancer of the lung, said kit 
comprising a probe for detection of TrpSa, Trp8b, Trp9, TrplOa and/or TrplOb or, 
alternatively, TrpSa, Trp8b, Trp9, TrplOa and/or TrplOb encoding mRNA or TrplOa/b 
antisense transcripts. The probe can be detectably labeled. Such probe may be a specific 
antibody or specific oligonucleotide. In a preferred embodiment, said kit contains an anti- 
Trp8a-, anti-Trp8b-, anti-Trp9-, anti-TrplOa-and/or anti-Trpl Ob-antibody and allows said 
diagnosis, e.g., by ELISA and contains the antibody bound to a solid support, for example, a 
polystyrene microliter dish or nitrocellulose paper, using techniques known in the art. 
Alternatively, said kits are based on a RIA and contain said antibody marked with a 
radioactive isotope. In a preferred embodiment of the kit of the invention the antibody is 
labeled with enzymes, fluorescent compounds, luminescent compounds, ferromagnetic probes 
or radioactive compounds. The kit of the invention may comprise one or more containers 
filled with, for example, one or more probes of the invention. Associated with container (s) of 
the kit can be a notice in the form prescribed by a governmental agency regulating the 
manufacture, use or sale of pharmaceuticals or biological products, which notice reflects 
approval by the agency of manufacture, us or sale for human administration. 



EXAMPLES 
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The following Examples are intended to illustrate, but not to limit the invention. While such 
Examples are typical of those that might be used, other methods known to those skilled in the 
art may alternatively be utilized. 

Example 1: Materials and Methods 

(A) Isolation of cDNA clones and Northern blot analysis 

Total RNA was isolated from human placenta an prostate using standard techniques. Isolation 
of mRNA was performed with poly (A) + RNA - spin columns (New England Biolabs, Beverly, 
USA) according to the instructions of the manufacturer. Poly (a) 4 KNA was reverse 
transcribed using the cDNA choice system (Gibco-BRL, Rockville, USA) and subcloned in A,- 
Zap phages (Stratagene, La Jolla, USA). An human expressed sequence tag (GenBank 
accession number 1404042) was used to screen an oligo d(T) primed human placenta cDNA 
library. Several cDNA clones were identified and isolated. Additional cDNA clones were 
isolated from two specifically primed cDNA libraries using primers 5'-gca tag gaa ggg aca 
ggt gg-3' and 5'-gag agt cga ggt cag tgg tcc-3\ 

cDNA clones were sequenced using a thermocycler (PE Applied Biosystems, USA) and 
Thermo Sequenase (Amersham Pharmacia Biotech Europe, Freiburg, Germany). DNA 
sequences were analyzed with an automated sequencer (Licor, Linccoln, USA). 

For Northern blot analysis 5 |ig human poly (A) + RNA from human placenta or prostate were 
separated by electrophoresis on 0.8 % agarose gels. Poly (A) + RNA was transferred to 
Hybond N nylon membranes (Amersham Pharmacia Biotech Europe, Freiburg, Germany). 
The membranes were hyridized in the presence of 50 % formamide at 42°C over night. DNA 
probes were labelled using [oc 32 P]dCTP and the „ready prime,, labelling kit (Amersham 
Pharmacia Biotech Europe, Freiburg, Germany). Commercial Northern blots were hybridized 
according to the distributors instructions (Clontech, Paolo Alto, USA). 

(B) Construction of expression plasmids and transfection of HEK293 cells 
Lipofections were carried out with the recombinant dicistronic eucaryotic expression plasmid 
pdiTRP8 containing the cDNA of TrpSb under the control of the chicken B-actin promotor 
followed by an internal ribosome entry side (IRES) and the cDNA of the green fluorescent 
protein (GFP). To obtain pdiTRP8 carrying the entire protein coding regions of TRP8b and 
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theGFP (Prasher, D.C. et al. (1992), Gene 111, 229-233), the 5 * and 3 '-untranslated sequences 
of the TRP8b cDNA were removed, the consensus sequence for initiation of translation in 
vertebrates (Kozak, M. (1987) Nucleic Acids Researchl5, 8125-8148) was introduced 
immediately 5 x of the translation initiation codon and the resulting cDNA was subcloned into 
the pCAGGS vector (Niwa, H., Yamamura, K. and Miyazaki, J (1991), Gene 8, 193-199) 
downstream of the chicken p-actin promotor. The IRES derived from encephalmyocarditis 
virus (Kim, D.G., Kang, H.M., Jang, S.K. and Shin H.S. (1992) Mol.CeU.Biol. 12, 3636- 
3643) followed by the GFP cDNA containing a Ser65Thr mutation (Heim, R, Cubitt, A.B., 
Tsien, R.Y. (1995) Nature 373, 663-664) was then cloned 3^ to the TRP8b cDNA. The IRES 
sequence allows the simultaneous translation of TRP8b and GFP from one transcript. Thus, 
transfected cells can be detected unequivocally by the development of green fluorescence. 

For monitoring of the intracellular Ca 2+ concentration human embryonic kidney (HEK 293) 
cells were cotransfected with the pcDNA3-TRP8b vector and the pcDNA3 -GFP vector in a 
molar ratio of 4 : 1 in the presence of lipofectamine (Quiagen, Hilden, Germany). To obtain 
pcDNA3-TRP8b the entire protein coding region of TRP8b including the consensus sequence 
for initiation of translation in vertebrates (Kozak, M. (1987) Nucleic Acids ResearchlS, 8125- 
8148) was subcloned into the pcDNA3 vector (Invitrogen, Groningen, Netherlands). Calcium 
monitoring and patch clamp experiments were carried out two days and one day after 
transfection, respectively. 

(C) Chromosomal localization of the Trp8 gene 

The chromosomal localization of the human TRP8 gene was performed using NIGMS 
somatic hybrid mapping panel No.2 (Coriell Institute, Camden, NJ, USA) previously 
described (Drwinga, H.L., Toji, L.H., Kim, C.H., Greene, A.E., Mulivor, R.A. (1993) 
Genomics 16, 311-314; Dubois, BX. andNaylor, S.L. (1993) Genomics 16, 315-319). 

(D) In Vitro Translation, glutathione - sepharose and calmodulin agarose binding assay 
N- and C-terminal Trp8-fragments were subcloned into the pGEX-4T2 vector (Amersham 
Pharmacia Europe, Freiburg, Germany) resulting in glutathione-S-transferase (GST)-TrpS 
fusion constructs (Fig. 4). The GST-TRP8-fusion proteins were expressed in E. coli BL 21 
cells and purified using glutathione - sepharose beads (Amersham Pharmacia Biotech Europe, 
Freiburg, Germany). 



23 



NO 0210382A2_I_> 



WO 02/10382 PCT/EPO 1/08309 

In vitro translation of human Trp8 cDNA and Xenopus laevis calmodulin cDNA (Davis, T.N. 
and Thorner, J. Proc.Natl.Acad.Sci. USA 86, 7909-7913.) was performed in the presence of 
35 S-methionine using the TNT coupled transcription/translation kit (Promega, Madison, 
USA). Translation products were purified by gel fiiltration (Sephadex G50, Amersham 
Pharmacia Biotech Europe, Freiburg, Germany) and equal amounts of 35 S labeled probes were 
incubated for 2 h with glutathione beads bound to GST - Trp8 or calmodulin - agarose 
(Calbiochem) in 50 mM Tris-HCl, pH 7.4, 0. 1 % Triton X-100, 150 mM NaCl in the presence 
of 1 mM Ca 2+ or 2 mM EGTA After three washes, bound proteins were eluted with SDS 
sample buffer, fractionated by SDS-PAGE and 35 S labeled proteins were detected using a 
Phosphor Imager (Fujifilm, Tokyo, Japan). 

(E) Calcium measurements 

The intracellular Ca 2+ concentration ([Ca 2+ ]0 was determined by dual wavelength fiira-2 
fluorescence ratio measurements (Tsien, R.Y. (1988) Trends NeuroscL 11, 419-424) using a 
digital imaging system (T.LL.L. Photonics, Planegg, Germany). HEK cells were grown in 
minimal essential medium in the presence of 10 % fetal calf serum and cotransfected with the 
pcDNA3-TRP8b vector and the pCDNA3-GFP vector as described above (B). Transfected 
cells were detected by development of green fluorescence. The cells were loaded with 4yM 
fura-2/AM (Molecular Probes, Oregon, USA) for one hour. After loading the cells were 
rinsed 3 times with buffer Bl (10 mM Hepes, 115 mM NaCl, 2 mM MgCl 2 , 5mM KC1, pH 
7.4) and the [Ca 2+ ]i was calculated from the fluorescence ratios obtained at 340 and 380 nm 
excitation wavelengths as described (Garcia, D.E., Cavalie, A. and Lux, H.D. (1994) J. 
Neurosci 14, 545-553). 

(F) Electrophysiological recordings 

HEK cells were transfected with the eucaryotic expression plasmid pdiTRP8 described in (B) 
and electrophysiolocigal recordings were carried out one day after transfection. Single cells 
were voltage clamped in the whole cell mode of the patch clamp technique as described 
(Hamill, O.P., Marty, A., Neher, E., Sakmann, B. and Sigworth, FJ. (1981) Pflugers Arch. 
391, 85-100; Philipp, S., Cavalie, A., Freichel, M., Wissenbach, U., Zimmer, S., Trost, C, 
Marquart, A., Murakami, M. and Flockerzi, V. (1996) EMBO J. 6166-6171). The pipette 
solution contained contained (mM): 140 aspartic acid, 10 EGTA, 10 NaCl, 1 MgCl2, 10 
Hepes (pH 7.2 with CsOH) or 125 CsCl, 10 EGTA, 4 CaCl 2 10 Hepes (pH 7,2 with CsOH). 
The bath solution contained (mM): 100 NaCl, 10 CsCl, 2 MgCl 2 , 50 mannitol, 10 glucose, 20 
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Hepes (pH 7,4 with CsOH) and 2 CaCl 2 , or no added CaCl 2 (-Ca 2+ solution). Divalent free 
bath solution contained (mM): 110 N-methyl-D-glucamine (NMDG). Whole cell currents 
were recorded during 100 msec voltage ramps from -100 to +100 mV at varying holding 
potentials. 

(G) In Situ Hybridization 

In situ hybridizations were carried out using formalin fixed tissue slices of 6 - 8 \iM thickness. 
The slices were hydrated and incubated in the presence of PBS buffer including 10 jig / ml 
proteinase K (Roche Diagnostics, Mannheim, Germany) for 0.5 h. The slices were hybridized 
at 37°C using biotinylated deoxy-oligonucleotides (0.5 pmol / fil) in the presence of 33 % 
formamide for 12 h. Furthermore the slices were several times rinsed with 2 x SSC and 
incubated at 25°C for 0.5 h with avidin / biotinylated horse raddish peroxidase complex 
(ABC, DAKO, Santa Barbara, USA). After several washes with PBS buffer the slices were 
incubated in the presence of biotinylated tyramid and peroxide (0.15 % w/v) for 10 min, 
rinsed with PBS buffer and additionally incubated with ABC complex for 0.5 h. The slices 
were washed with PBS buffer and incubated in the presence of DAB solution 
(diaminobenzidine (50^ig / ml), 50 mM Tris/EDTA buffer pH 8.4, 0.15 % H 2 0 2 in N,N - 
dimethyl-formamide; Merck, Darmstadt, Germany), The detection was stopped after 4 
minutes by incubating the slides in water. Tyramid was biotinylated by incubating NHS-LC 
Biotin (sulfosuccinimidyl-6-(biotinimid)-hexanoat), 2.5 mg / ml; Pierce, Rockford, USA) and 
tyramin-HCl (0.75 mg / ml, Sigma) in 25 mM borate buffer pH 8.5 for 12 h. The tyramid 
solution was diluted 1 - 5 : 1000 in PBS buffer. 

(H) GenBank accession numbers: TRP8a, Aj243500; TRP8b Aj243501 
Example 2: Expression of TKP8 transcripts 

In search of proteins distantly related to the TUP family of ion channels, an human expressed 
sequence tag (EST, GenBank accession number 1404042) was identified in the GenBank 
database using BLAST programms (at the National Center for Biotechnology Information 
(NCBI); Altschul, S.F., Gish, W., Miller, W., Myers, E.W. and Lipman, DJJ. (1990) Mol. 
Biol. 5, 403-410) being slightly homologous to the VR1 gene. Several human placenta cDNA 
libraries were constructed and screeened with this EST DNA as probe. Several full length 
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cDNA clones were identified and isolated. The full length cDNA clones encoded two putative 
proteins differing in three amino acids and were termed TrpSa and Trp8b (Fig. 1c, 2a, 7 and 
8A). This finding was reproduced by isolating cDNA clones from two cDNA libraries 
constructed from two individual placentas. The derived protein sequence(s) comprises six 
transmembrane domains, a characteristic overall feature of trp channels and related proteins 
(Fig.: lb). The sequence is closely related to the meanwhile published calcium uptake 
transport protein 1 (CaTl), isolated from rat intestine (Peng, J.B., Chen, X.Z., Berger, U.V., 
Vassilev, P.M., Tsukaguchi, H., Brown, E.M. and Hediger M.A.(1999) J Biol Chem. 6;274, 
22739-22746) and to the epithelial calcium uptake channel (ECaC) isolated from rabbit 
kidney (Hoenderop, J.G., van der Kemp, A.W., Hartog, A., van de Graaf, S.F., van Os, C.H., 
Willems, P.H. and Bindels, RJ. (1999) J Biol Chem. 26;274, 8375-8378). Expression of 
Trp8a^b transcripts are detectable in human placenta, pancreas and prostate (Fig.: 5) and the 
size of the Northern signal (3.0 kb) corresponds with the size of the isolated full length 
cDNAs. In addition, a shorter transcript of 1.8 kb, probably a splice variant, is detectable in 
human testis. The Trp8 mRNA is not expressed in small intestine or colon (Fig.: 5) 
implicating that Trp8 is not the human ortholog of the rat CaTl or rabbit ECaC proteins. To 
investigate whether there are other related sequences Trp8a/b derived primers (UW241, 5'- 
TAT GAG GGT TCA GAC TGC-3' and UW242, 5'-CAA AGT AGA TGA GGT TGC-3') 
were used to amplify a 105 bp fragment from human genomic DNA being 95% identical on 
the nucleotide level to the Trp8 sequence (data not shown). This indicates the existence of 
several similar sequences in humans at least at the genomic level. 

Example 3: Two variants of the Trp8 protein (Trp8a and Trp8b) arise by polymorphism 

Two variants of the Trp8 cDNA were isolated from human placenta (Fig.: 2A, 7 and 8 A) 
which encoded two proteins which differ in three amino acids and were termed TrpSa and 
Trp8b. Trp8aft specific primers were designed to amplify a DNA fragment of 458 bp of the 
Trp8 gene from genomic DNA isolated from human T-lymphocytes (primer pair: UW243, 5'- 
CAC CAT GTG CTG CAT CTA CC-3' and UW244, 5'-CAA TGA CAG TCA CCA GCT 
CC-3'). The amplification product contains a part of the sequence where the derived protein 
sequence of Trp8a comprises the amino acid valine and the Trp8b sequence methionine as 
well as a silent base pair exchange (g versus a) and an intron of 303 bp (Fig.: 2. A, B). Both 
variants of the TrpS genes (a,b) were amplified from genomic DNA in equal amounts 
indicating the existence of both variants in the human genome and therefore being not the 
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result of RNA editing (Fig.: 2B). The Trp8a gene can be distinguished from the Trp8b gene 
by cutting the genomic fragment of 458bp with the restriction enzyme Bspl286I (Fig. 2B). 
Using human genomic DNA isolated from blood of twelve human subjects as template, the 
458bp fragment was amplified and restricted with BSP1286I. In eleven of the tested subjects 
only the TrpSb gene is detectable, while one subject (7) contains TrpSa and Trp8b genes (Fig.: 
2D). These implicates that the two Trp8 variants arise by polymorphism and do not represent 
individual genes. Using TrpS specific primers and chromosomal DNA as template, the Trp8 
locus is detectable on chromosome 7 (Fig.: 2C). 

Example 4: Trp8b is a calcium permeable channel 

The protein coding sequence of the Trp8b cDNA was subcloned into pcDNA3 vector 
(Invitrogen, Groningen, Netherlands) under the control of the cytomegalovirus promotor 
(CMV). Human embryonic kidney (HEK 293) cells were cotransfected with the Trp8b 
pcDNA3 construct (pcDNA3-Trp8b vector) and the pcDNA3-GFPvector encoding the green 
fluorescent protein (GFP) in 4:1 ratio. The Trp8b cDNA and the cDNA of the reporter, GFP, 
was transiently expressed in human embryonic kidney (HEK 293) cells. The intracellular Ca 2+ 
concentration ([Ca 2+ ]i) and changes of [Ca 2+ ]i were determined by dual wavelength fura-2 
fluorescence ratio measurements (Fig.: 3F) in cotransfected cells which were identified by the 
green fluorescence of the reporter gene GFP. 

Dual wavelength fiira-2 fluorescence ratio measurement is a standard procedure (e.g. in: An 
introduction of Molecular Neurobiology (ed. Hall, Z.W.)Sinauer Associates, Sunderland, 
USA (1992)) using fura-2, which is a fluorescent Ca 2+ sensitive dye and which was designed 
by KY.Tsien (e.g. Trends Neurosci. 11, 419-424 (1988) based upon the structure of EGTA. 
Its fluorescence emission spectrum is altered by binding to Ca 2+ in the physiological 
concentration range. In the absence of Ca 2+ , fura-2 fluoresces most strongly at an excitation 
wavelength of 385 nm; when it binds Ca 2+ , the most effective excitation wavelength shifts to 
345 nm. This property is used to measure local Ca 2+ concentrations within cells. Cells can be 
loaded with fura-2 esters (e.g. fiira-2AM) that diffuse across cell membranes and are 
hydrolyzed to active fiira-2 by cytosolic esterases. 

In the presence of ImM Ca 2 * TrpS expressing cells typically contained more than 300 nM 
cytosolic Ca 2+ , while non transfected controls contained less than 100 nM Ca 2+ ions (Fig. 3F). 
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When TrpSb transfected cells were incubated without extracellular Ca 2+ , the intracellular Ca 2+ 
concentration ([Ca 2 " f ]i) decreased to levels comparable to non transfected cells. Readdition of 
ImM Ca 2+ to the bath resulted in significant increase of the cytosolic [Ca 2+ ] in Trp8b 
transfected cells, but not in controls (Fig.: 3F). After readdition of Ca 2+ ions to the bath 
solution, the cytosolic Ca 2+ concentration remains on a high steady state level in the Trp8b 
transfected cells. 

Example 5: Trp8 expressing cells show calcium selective inward currents 

To characterize in detail the electrophysiological properties of TRP8, TRP8 and GFP were 
coexpressed in HEK293 cells using the dicistronic expression vector pdiTRP8 and measured 
currents using the patch clamp technique in the whole cell mode (Hamill, O.P., Marty, A., 
Neher, E., Sakmann, B. and Sigworth, F.J. (19Sl)Pflugers Arch., 391, 85-100). 

The eucaryotic expression plasmid pdiTRPS contains the cDNA of Trp8b under the control of 
the chicken B-actin promotor followed by an internal ribosome entry side (IRES) and the 
cDNA of the green fluorescent protein (GFP). To obtain pdiTRP8 carrying the entire protein 
coding regions of TRP8b and the GFP (Prasher, DC. et al. (1992), Gene 111, 229-233), the 
5 s and 3 '-untranslated sequences of the TRP8b cDNA were removed, the consensus sequence 
for initiation of translation in vertebrates (Kozak, M. (1987) Nucleic Acids Researchl5, 8125- 
8148) was introduced immediately 5 x of the translation initiation codon and the resulting 
cDNA was subcloned into the pCAGGS vector (Niwa, H., Yamamura, K. and Miyazaki, J 
(1991), Gene 8, 193-199) downstream of the chicken p-actin promotor. The IRES derived 
from encephalmyocarditis virus (Kim, D.G., Kang, H.M., Jang, S.K. and Shin H.S. (1992) 
Mol.Cell.Biol. 12, 3636-3643) followed by the GFP cDNA containing a Ser65Thr mutation 
(Heim, R., Cubitt, A.B., Tsien, R.Y. (1995) Nature 373, 663-664) was then cloned 3^ to the 
TRP8b cDNA. The IRES sequence allows the simultaneous translation of TRP8b and GFP 
from one transcript. Thus, transfected cells can be detected unequivocally by the development 
of green fluorescence. 

In the presence of 2 mM external calcium, TrpSb transfected HEK cells show inwardly 
rectifying currents, the size of which depends on the level of intracellular calcium and the 
electrochemical driving force. The resting membrane potential was held either at -AO mV, or, 
to lower the driving force for calcium influx in between pulses, at + 70 mV. Current traces 
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were recorded in response to voltage ramps from -100 to +100 mV, that were applied every 
second. To monitor inward and outward currents over time, we analyzed the current size at - 
80 and + 80 mV of the ramps. Figure 3A shows a representative trace of the current at - 80 
mV over time. Both at a holding potential of -40 mV or at +70 mV, the currents are 
significantly larger than in cells transfected with only the GFP containing vector (Fig.: 3E). 
Interestingly, after changing to a positive holding potential, current size in Trp8 transfected 
cells slowly increases and reaches steady state after approximately 70 seconds (Fig.: 3 A). To 
determine the selectivity of the induced currents, we then perfused the cells with solutions that 
either contain no sodium, no added Ca 2+ (Fig. 3 A, C) or a sodium containing, but divalent ion 
free bath solution. To control for the effect of the solution change alone, we also perfused 
with normal bath (see puff in Fig. 3 A). While removal of external Ca 2+ completely abolishes 
the trp 8 induced currents - the remaining current being identical in size and shape to the 
control (Fig.: 3 A, C, E), removal of external sodium has no effect (Fig.: 3E). An important 
hallmark of calcium selective channels (e.g. Vennekens, R., Hoenderop, G.J., Prenen, J., 
Stuiover, M., Willems, PHGM, Droogmans, G., Nilius, B.and Bindels, RJ.M (1999) J. Biol. 
Chem. 275, 3963-3969), is their ability to conduct sodium only if all external divalent ions, 
namely Ca 2+ and magnesium are removed. To test whether the trp 8 channel conforms with 
this phenomenon normal bath solution was switched to a solution containing only sodium and 
1 mM EGTA. As can be seen in Figure 3B and D, Trp8 transfected cells can now conduct 
very large sodium currents. Interestingly, immediately after the solution change, the currents 
first become smaller before increasing rapidly, indicating that the pore may initially still be 
blocked by calcium a phenomenon usually called anomalous mole fraction behaviour 
(Warnat, J., Philipp, S., Zimmer, S.,.Flockerzi, V., and Cavalie A.(1999) J.Physiol. (Lond) 
518, 631-638). The measured outward currents of Trp8 transfected cells in normal bath 
solution are not significantly different from non-transfected control cells or cells which only 
express the reporter gene GFP. As the removal of external Ca 2+ abolishes the Trp8 specific 
current, the remaining current was subtracted from the current before the solution change to 
obtain the uncontaminated Trp8 conductance (see inset in Fig.: 3C). As expected from the 
given ionic conditions (high EGTA inside, 2 mM Ca 2+ outside), the current-voltage 
relationship now shows prominent inward rectification with little to no outward current. 

Both the time course of the development of Trp8 currents and the size of the currents depend 
on the frequency of stimulation (data not shown), the internal and external Ca 2+ concentration 

29 



) 0210382A2_I_> 



WO 02/10382 



PCT/EP01/08309 



and the resting membrane potential, suggesting that Trp8 calcium conductance is intrically 
regulated by a Ca 2+ mediated feedback mechanisms. 



Example 6: Ca / calmodulin binds to the C-terminus of the Trp8 protein 

To test whether calmodulin, a prime mediator of calcium regulated feedback, is involved, first 
it was investigated biochemically whether TrpS protein can bind calmodulin. Trp8 cDNA was 
in vitro translated in the presence of 35 S-methionine and the product incubated with 
calmodulin-agarose beads. After several washes either in the presence or abscence of Ca 2+ , 
the beads were incubated in Laemmli buffer and subjected to SDS-polyacrylamide gel 
electrophoresis. In the presence of Ca 2+ (ImM), but not in the absence of Ca 2+ , Trp8 protein 
binds to calmodulin (Fig.: 4B). 

To narrow down the binding site, two approaches were undertaken: Firstly, GST-TKP8 
fusion proteins of various intracellular domains of TrpS were constructed, expressed in E. coli 
and bound to gluthathione sepharose beads. These beads were then incubated with in vitro 
translated 35 S- labeled calmodulin, washed and subjected to gel electrophoresis. Secondly, 
truncated versions of in vitro translated Trp8 protein were used in the above described binding 
to calmodulin-agarose. As shown in Figure 4A, and C, fusion proteins of the N-terminal 
region (Nl, N2) of Trp8 did not bind calmodulin, while C-terminal fragments (CI, C2, C3, 
C4) showed calmodulin binding in the presence of calcium (for localization of fragments 
within the entire TrpS protein see Fig. 4C). Accordingly, a truncated version of in vitro 
translated Trp8, which lacks the C-terminal 32 amino acid residues did not bind to 
calmodulin-agarose (4B). We have restricted the calmodulin binding site to amino acid 
residues 691 to 71 1 of the TrpS protein. This calmodulin binding site does not resemble the 
typical conserved IQ - motif of conventional myosins, but has limited sequence homology to 
the calcium dependent calmodulin binding site 1 of the transient receptor potential like (trpl) 
protein of Drosophila melanogaster (Warr and Kelly, 1996) with several charged amino acid 
residues conserved. The sequence of the calmodulin binding site of the Trp8 protein 
resembles a putative amphipathic a-helical wheel structure with a charged and a hydrophobic 
site according to a model proposed by Erickson-Vitanen and De Grado (1987, Methods 
Enzymol. 139, 455-478.). 
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Example 7: Expression of Trp8 transcripts in human placenta and pancreas 



Several slides from a human placenta of a ten week old abort were used for in situ 
hybridization experiments. The in situ hybridization experiments revealed expression of TrpS 
transcripts in human placenta (Fig.: 5B). Expression was detectable in trophoblasts and 
syncytiotrophoblasts of the placenta, but not in Langhans cells. 

TrpS transcripts are detectable in human pancreas (Fig.: 5 A). Therefore Trp8 probes were 
hybridized to tissue sections of human pancreas. The pancreatic tissues were removed from 
patients with pancreas cancer. TrpS expression is detectable in pancreatic acinar cells, but not 
in Langerhans islets (Fig.: 5C). No Trp8 expression was found in regions of pancreatic 
carcinomas (data not shown). 

Furthermore, the Trp8 cDNA is not detectable in human colon nor in human kidney by in situ 
hybridization as well as by Northern analysis (Fig.: 5 A, D). The Northern results taken 
together with the in situ expression data indicate that the Trp8 protein is not the human 
ortholog of the CaTl and ECaC channels cloned from rat intestine (Peng, J.B., Chen, X.Z., 
Berger, U.V., Vassilev, P.M., Tsukaguchi, H., Brown, KM. and Hediger M.A(1999) J Biol 
Chem. 6;274, 22739-22746) and from rabbit kidney (Hoenderop, J.G., van der Kemp, A.W., 
Hartog, A., van de Graaf, S.F., van Os, C.H., Willems, P.H. and Bindels, R.J. (1999) J Biol 
Chem. 26;274, 8375-8378), respectively. Trp8 is unlikely to represent the human version of 
CaTl as its expression is undetectable in the small intestine and colon tissues where CaTl is 
abundantly expressed. If, however, TrpS is the human version of rat CaTl, a second gene 
product appears to be required for Ca 2+ uptake in human small intestine and colon attributed 
to CaTl in rat small intestine and colon. 

Example 8: Differential expression of TrpS transcripts in benign and malign tissue of 
the prostate 

The TrpS transcripts are expressed in human prostate as shown by hybridization of a Trp8 
probe to a commercial Northern blot (Clontech, Palo Alto, USA) (Fig.: 5 A). Trp8 transcripts 
were not detectable by Northern blot analysis using pooled mRNA of patients with benign 
prostatic hyperplasia (BPH) (Fig.: 5 A, prostate*). To examine Trp8 expression on the cellular 
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level, sections of prostate tissues were hybridized using Trp8 specific cDNA probes (Table 3). 
Expression of Trp8 transcripts is not detectable in normal prostate (n = 3), benign hyperplasia 
(BPH, n = 15) or prostatic intraepithelial neoplasia (PIN, n = 9) (Fig.: 6 A, C, E). Trp8 
transcripts were only detectable in prostate carcinoma (PCA), although with different 
expression levels. Low expression levels were found in primary carcinomas (2 - 10 % of the 
carcinoma cells, n = 8) (Fig.: 7B) . Much stronger expression was detectable in rezidive 
carcinoma (10-60 %) (Fig.: 7D, n = 6) and metastases of the prostate (60 - 90 %, n = 4) 
(Fig.: 7F). Thus it has to be concluded that the commercial Northern blot used in Fig.: 5 A 
contains not only normal prostate mRNA as indicated by the distributor. According to the 
distributors instructions the prostate mRNA used for this Northern blot was collected from 15 
human subjects in the range of 14 to 60 years of age. This prostate tissue was not examined 
by pathologic means. Since Trp8 expression is not detectable in normal or benign prostate, 
this finding implicates that the mRNA used for this Northern blot was extracted in part from 
prostatic carcinoma tissue. To summarize, TrpS expression is only detectable in malign 
prostate and, thus, the Trp8 cDNA is a marker for prostate carcinoma. The results are 
summarized in Table 4. 

Table 3 

TrpS probes used for in situ hybridization: 
Probes (antisense) 

1. ) 5' TCCGCTGCCGGTTGAGATCTTGCC 3' 

2. ) 5' CTTGCTCCATAGGCAGAGAATTAG 3' 

3. ) 5' ATCCTC AGAGCCCCGGGTGTGGAA3 ' 

Controls (sense) 

1. ) 5 5 GGCAAGATCTCAACCGGCAGCGGA 3 s 

2. ) 5' CTAATTCTCTGCCTATGGAGCAAG V 

3. ) 5' TTCCACACCCGGGGCTCTGAGGAT 3* 



Table 4 

Prostate total negative positive 

normal 3 3 0 

BPH 15 15 0 

PIN 9 9 0 
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(B) Differential expression of Trp8 transcripts in benign and malign tissue of the uterus 

Moreover it could be shown that Trp8 is expresed in endometrial cancer (also called cancer of 
the uterus, to be distinguished from uterine sarcoma or cancer of the cervix) whereas no 
expression was observed in normal uterus tissue. Thus, Trp8 also is a specific marker for the 
diagnosis of the above cancer (Fig. 12). 

Example 9: Characterization of Trp9 

The complete protein coding sequence of Trp9 was determined (Fig. 9). Trp 9 transcripts are 
predominantly expressed in the human prostate and in human colon. As it could be shown by 
Northern blot analysis, there is no difference of the expression of TRP9 in benigne prostata 
hyperplasia (BPH, Fig. 13, upper panel left) or prostate carcinoma (Fig. 13, upper panel 
right). However, Trp9 is useful as a reference marker for prostata carcinoma, i.e. can be used 
for quantifying the expression level of Trp8. The ratio of the expression of Trp8:Trp9 in 
patients and healthy individuals is useful for the development of a quantitative assay. 

Example 10: Characterization of TrplO 

The complete protein coding sequence of TRP 10 (a and b) was determined by biocomputing 
(Fig. 10 and 11). Using a 235 bp fragment of the TrplO cDNA as probe in Northern blot 
analysis TRP10 transcripts could only be detected in rnKNA isolated from individuals with 
prostate cancer (Fig. 13, bottom panel) but not in mRNA isolated from benign tissue of the 
prostate (prostate BPH) nor in mRNA isolated from heart, brain, placenta, lung, liver, skeletal 
muscle, kidney and pancreas. The 235 bp cDNA fragment of the TrplO cDNA was amplified 
using the primer pair UW248 5'-ACA GCT GCT GGT CTA TTC C-3' and UW249 5'-TAT 
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GTG CCT TGG TTT GTA CC-3' and prostate cDNA as template. In summary, TrplOa and 
TrplOb , like TRP8 are also expressed in malignant prostate tissue. So far, its expression 
could not be observed in any other tissue examined (see above). Thus, Trp 10a and TrplOb 
are also useful markers which are specific for malignant prostate tissue. 

Furthermore, database searches in public databases of the national center for biological 
information (NCBI) revealed the existence of several expressed sequence tags (EST clones) 
being in part identical to the Trp 10 sequence. These EST clones were originally isolated from 
cancer tissues of lung, placenta, prostate and from melanoma. These clones include the 
clones with the following accession numbers: BE274448, BE408880, BE207083, BE791173, 
AI671853, BE3 90627. The results demonstrate that cancer cells of these tissues express 
TrplO related transcripts whereas no expression of TrplO transcripts in the corresponding 
healthy tissues are detectable (Figure 13). Furthermore, it could be shown that in cancer cells 
of melanoma and prostate cancer TrplO transcripts are expressed as shown by in situ 
hybridizations using 4 antisense probes (Figure 14A - E and 13K-0 and Table 2, above). 
Furthermore, it could clearly be shown that cancer cells of these tissues expressing TrplO 
transcripts also express TrplO-antisense transcripts as shown in Figure 14F-J, Figure 14P-R 
and Figurel4T by in situ hybridizations using 4 sense probes (Table 2, above). The in situ 
hybridization experiments demonstrate that detection of a subset of cancer cells derived from 
carcinoma of lung, placenta, prostate and melanoma is feasible using antisense as well as 
sense probes complementary to TrplO transcripts or complementary to TrplO-antisense 
transcripts, respectively. 

The foregoing is meant to illustrate but not to limit the scope of the invention. The person 
skilled in the art can readily envision and produce further embodiment, based on the above 
teachings, without undue experimentation. 
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What Is claimed Is: 

1. An isolated nucleic acid molecule encoding the human prostate carcinoma associated 
protein Trp8a, Trp8b, Trp9, Tip 10a or Tip 10b or a protein exhibiting biological properties 
of TrpSa, Trp8b, Trp9, Tip 10a or TrplOb and being selected from the group consisting of 

(a) a nucleic acid molecule encoding a protein that comprises the amino acid sequence 
depicted in Figure 7, 8A, 9, 10 or 1 1; 

(b) a nucleic acid molecule comprising the nucleotide sequence depicted in Figure 7, 8A, 
9, 10 or 11; 

(c) a nucleic acid molecule included in DSMZ Deposit No. DSM 13579, DSM 13580, DSM 
13584 , DSM 13581 or DSM....; 

(d) a nucleic acid molecule which hybridizes to a nucleic acid molecule specified in (a) to (c); 

(e) a nucleic acid molecule the nucleic acid sequence of which deviates from the nucleic 
sequences specified in (a) to (d) due to the degeneration of the genetic code; and 

(f) a nucleic acid molecule, which represents a fragment, derivative or allelic variation of a 
nucleic acid sequence specified in (a) to (e). 

2. A recombinant vector containing the nucleic acid molecule of claim 1 

3. The recombinant vector of claim 2 wherein the nucleic acid molecule is operatively linked 
to regulatory elements allowing transcription and synthesis of a translatable RNA in 
prokaryotic and/or eukaryotic host cells. 

4. A recombinant host cell which contains the recombinant vector of claim 3. 

5. The recombinant host cell of claim 4, which is a mammalian cell, a bacterial cell, an insect 
cell or a yeast cell. 

6. An isolated protein exhibiting biological properties of the human prostate carcinoma 
associated protein Trp8a, Trp8b, Trp9, TrplOa or TrplOb which is encoded by a nucleic acid 
molecule of claim 1 . 

7. A recombinant host cell that expresses the isolated protein of claim 6. 
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8. A method of making an isolated protein exhibiting biological properties of the human 
prostate carcinoma associated protein Trp8a, TrpSb, Trp9, Trp 10a or TrplOb comprising: 

(a) culturing the recombinant host cell of claim 6 under conditions such that said protein is 
expressed; and 

(b) recovering said protein. 

9. The protein produced by the method of claim 8. 

10. An antisense RNA sequence characterized in that it is complementary to an mRNA 
transcribed from a nucleic acid molecule of claim 1 or a part thereof and can selectively bind 
to said mRNA or part thereof, said sequence being capable of inhibiting the synthesis of the 
protein encoded by said nucleic acid molecule. 

11. A ribozyme characterized in that it is complementary to an mRNA transcribed from a 
nucleic acid molecule of claim 1 or a part thereof and can selectively bind to and cleave said 
mRNA or part thereof thus inhibiting the synthesis of the protein encoded by said nucleic 
acid molecule. 

12. An inhibitor characterized in that it can suppress the activity of the protein of claim 6. 

13. A method for diagnosing a prostate carcinoma which comprises contacting a target 
sample suspected to contain the protein Trp8a, Trp8b, TrplOa and/or TrplOb or the Trp8a, 
TrpSb, TrplOa and/or TrplOb encoding mRNA with a reagent which reacts with TrpSa, 
TrpSb, TrplOa and/or TrplOb or the Trp8a, Trp8b, TrplOa and/or TrplOb encoding mRNA 
and detecting Trp8a, Trp8b, TrplOa and/or TrplOb or the Trp8a, Trp8b, TrplOa and/or 
TrplOb encoding mRNA. 

14. The method of claim 13, wherein the reagent is a nucleic acid. 

15. The method of claim 13, wherein the reagent is an antibody. 

16. The method of claim 13, wherein the reagent is detectably labeled. 
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17. The method of claim 16, wherein the label is selected from the group consisting of a 
radioisotope, a bioluminescent compound, a chemiluminescent compound, a fluorescent 
compound, a metal chelate, or an enzyme. 

18. A method for diagnosing an endometrial cancer (carcinoma of the uterus) which 
comprises contacting a target sample suspected to contain the protein TrpSa and/or Trp8b or 
the Trp8a and/or TrpSb encoding mRNA with a reagent which reacts with Trp8a and/or TrpSb 
or the TrpSa and/or Trp8a and/or trp8b encoding mRNA and detecting Trp8a and/or TrpSb or 
the Trp8a and/or Trp8b encoding mRNA. 

19. The method of claim 18, wherein the reagent is a nucleic acid. 

20. The method of claim 18, wherein the reagent is an antibody. 

21. The method of claim 18, wherein the reagent is detectably labeled. 

22. The method of claim 21, wherein the label is selected from the group consisting of a 
radioisotope, a bioluminescent compound, a chemiluminescent compound, a fluorescent 
compound, a metal chelate, or an enzyme. 

23. A method for diagnosing a melanoma, chorion carcinoma, cancer of the lung and of the 
prostate in a tissue of a subject, comprising contacting a sample with a reagent which detects 
TrplOa and/or TrplOb antisense RNA or TrplOa and/or TrplOb related antisense RNA. 

24. A method for preventing, treating, or ameliorating a prostate tumor, endometrial cancer 
(carcinoma of the uterus) tumor, a chorion carcinoma, cancer of the lung or melanoma, which 
comprises administering to a mammalian subject a therapeutically effective amount of a 
reagent which decreases or inhibits expression of Trp8a, TrpSb, TrplOa and/or TrplOb and/or 
the activity of TrpSa, Trp8b, TrplOa and/or TrplOb. 

25. The method of claim 24, wherein the reagent is a nucleotide sequence comprising an 
antisense RNA 
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26. The method of claim 24, wherein the reagent is a nucleotide sequence comprising a 
ribozyme. 



27. The method of claim 24, wherein the reagent is an inhibitor of Trp8a, Trp8b, Tip 10a 
and/or TrplOb. 

28. The method of claim 27, wherein the reagent is an anti-Trp8a-, anti Trp8b-, anti-Tip 10a- 
and/or anti-Tip 10b antibody or a fragment thereof. 

29. A diagnostic kit useful for the detection of TrpSa, TrpSb, TrplOa and/or TrplOb or TrpSa, 
Trp8b, TrplOa and/or TrplOb encoding mRNA or TrplOa and/or TrplOb antisense transcripts 
in a sample, wherein the presence of an increased concentration of Trp8a, Trp8b, Trp9, 
TrplOa and/or TrplOb or Trp8a, Trp8b, Trp9, TrplOa and/or TrplOb encoding mRNA or 
TrplOa and/or TrplOb antisense transcripts is indicative for a prostate tumor, endometrial 
cancer (cancer of the uterus) tumor, a chorion carcinoma, cancer of the lung or melanoma, 
said kit comprising a probe for detection of Trp8a, Trp8b, Trp9, TrplOa or TrplOb or TrpSa, 
Trp8b, Trp9, TrplOa and/or TrplOb encoding mRNA or TrplOa and/or TrplOb antisense 
transcripts. 

30. The kit of claim 29, wherein the target component to be detected is Trp8a, Trp8b, Trp9, 
TrplOa and/or TrplOb and the probe is an antibody. 

31. A method for identifying a compound which acts as an agonist or antagonist on the ion 
channels Trp8, Trp9 and/or Tip 10, said method comprising contacting a test compound 
with the ion channel Trp8, Trp9 and/or Trpl0,and determining whether said test compound 
affects the calcium uptake. 
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10 30 50 

GCCAAGTGTAACAAACTCACAGCCCTCTCCAAACTGGCTGGGGCTGCTGGGAGACTCCCA 

70 90 110 

AGGAACTCGTC AG G AAGGC AGG AGAC AG G AG ACGGG ACCT CT ACAGGGAG ACGGTGGGCC 

130 150 170 

GGCCCTTGGGGGGGCTGATGTGGCCCCAAGGCTGAGTCCCGTCAGGGTCTGGCCTCGGCC 

190 210 230 

TCAGGCCCCCAAGGAGCCGGCCCTACACCCCATGGGTTTGTCACTGCCCAAGGAGAAAGG 

MGISLPKEKG 
250 270 290 

GCTAATTCTCTGCCTATGGAGCAAGTTCTGCAGATGGTTCCAGAGACGGGAGTCCTGGGC 
LI LC LWS KFCRWFQRRE SWA 
310 330 350 

CCAGAG C CGAG ATG AGC AGAACCTGCTGC AGC AGAAGAGG ATCTGGGAGT CT CCTCT CCT 
QSRDEQNLLQQKRIWESPLL 
370 390 410 

TCTAGCTGC C AAAG ATAAT G ATGT CCAGGCCCT G AACAAGTTGCT C AAGT AT G AGGATTG 
LAAKDN DVQALNKLLKYED C 
430 450 470 

CAAGGT G CAC C AG AGAGG AG CCAT GGGGGAAAC AGCGCTACACAT AGCAG CCCTCT AT GA 
KVHQRGAMGETALHIAALYD 
490 510 530 

CAACCTGGAGGCCGCCATGGTGCTGATGGAGGCTGCCCCGGAGCTGGTCTTTGAGCCCAT 
NLEAAMVLMEAAPELVFEP M 
550 570 590 

GACATCTGAGCTCTATGAGGGTCAGACTGCACTGCACATCGCTGTTGTGAACmGAACAT 
TSELYEGQTA1HIAVVNQNM 
610 630 650 

GAACCTGGTGCGAGCCCTGCTTGCCCGCAGGGCCAGTGTCTCTGCCAGAGCCACAGGCAC 
NLVRALLARRASVSARATGT 
670 690 710 

TGCCTTCCGCCGTAGTCCCCGCAACCTCATCTACTTTGGGGAGCACCCTTTGTCCTTTGC 
AFRRSPRNLIYFGEHPLSFA 
730 750 770 

TGCCTGT GT GAACAGT GAGG AGATCGTGCGG CT G CTGATTGAGCATGG AGCT G ACATCCG 
ACVNSEEIVRLLIEHGADIR 
790 810 830 

GGCCCAGGACTCCCTGGGAAACACAGTGTTACACATCCTCATCCTCCAGCCCAACAAAAC 
AQDSLGNTVLHILILQPNKT 
850 870 890 

CTTTGCCTGCCAGATGTACAACCTGTTGCTGTCCTACGACAGACATGGGGACCACCTGCA 
FACQMYNLLLSYDRHGDHLQ 
910 930 950 

GCCCCTGGACCTCGTGCCCAATCACCAGGGTCTCACCCCTTTCAAGCTGGCTGGAGTGGA 
PL DLVPNHQGLTPFKLAGVE 
970 990 1010 

GGGTAACACTGTGATGTTTCAGCACCTGATGCAGAAGCGGAAGCACACCCAGTGGACGTA 
GNTVMFQHLMQKRKHTQWTY 
1030 1050 1070 

TGG ACC ACTG AC CT CG ACTCTCTATGAC CT CACAGAG AT CGACTCCT CAGG GG ATGAGCA 
GPLTSTLYDLTEI DSSGDEQ 
1090 1110 H30 

GTCCCTGCTGGAACTTATCATCACCACCAAGAAGCGGGAGGCTCGCCAGATCCTGGACCA 
SLLELIITTKKREARQILDQ 
1150 1170 H90 

GACGCCGGTGAAGGAGCTGGTGAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTG 
TPVKELV SLKWKRYGRPYFC 
1210 1230 1250 

CATGCTGGGTGCCATATATCTGCTGTACATPATCTGCTTCACCATGTGCTGCATCTACCG 
MLGAlYLLYriCFTMCCIYR 
1270 1290 1310 
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CCCCCTCAAGCCCAGGACCAATAACCGCACAAGCCCCCGGGACAACACCCTCTTACAGCA 
PLKPRTNNRTSPRDNTLLQQ 
1330 1350 1370 

GAAGCTACTTCAGGAAGCCTACGTGACCCCTAAGGACGATATCCGGCTGGTCGGGGAGCT 
KLLQEAYVTPKDDIRLVGEL 
1390 1410 1430 

GGTGACTGTCATTGGGGCTATCATC^TCCrGCTGGTAGAGGTTCCAGACATCTTCAGAAT 
V T V I G A I IILLVEVPDIFRM 
1450 1470 1490 

GGGGGTCACTCGCTTCTTTGGACAGACCATCCTIGGGGGCCCATTCCATGTCCTCATCAT 
GVTRFFGQTILGGPFHVLI I 
1510 1530 1550 

CACCTATGCCTTCATGGTGCTGGTGACCATGGTGATGCGGCTCATCAGTGCCAGCGGGGA 
TYAFMVLVTMVNRLISASGE 

1570 1590 1610 

GGTGGTACCCATGTCCTTTGCACTCGTGCTGGGCTGGTGCAACGTCATGTACTTCGCCCG 
VVPMS FALV LGWCNVMY FAR 
1630 1650 167*0 

AGGATT CC AGATGCT AGGC CCCTT CACCATCATGATTCAGAAG ATGATTTT T GGCGACCT 
GFQMLGPFTIMIQKMIFGDL 
1690 1710 1730 

GATGCGATTCTGCrGGCTGATGGCTGTGGTCATCCTGGGCTTTGCTTCAGCCTTCTATAT 
MRFCWLMAVVILGFASAFYI 

1750 1770 1790 

CATCTTCCAGACAGAGGACCCCGAGGAGCTAGGCCACTTCITACGACTACCCCATGGCCCT 
IFQTEDPEELGHFYDYPMAL. 

1B10 1830 1850 

GTTCAGCACCTTCGAGCTGTTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGA 
FSTFELFLTIIDGPANYNVD 

1870 1890 1910 

CCTGCCCTTCATGTACAGCATCACCTATGCTGCCTTTGCCATCATCGCCACACTGCTCAT 
LPFMYS ITYAAFAIIATLLM 
1930 1950 1970 

GCTC AAC CTCCT C ATTG CC ATGATGGG CGAC ACTC ACTGGCG AGTGGC CC ATGAGCGGG A 
LNLLIAMMGDTHWRVAHE RD 
1990 2010 2030 

TGAGCTGTGGAGGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCG 
E L WRAQ IVATTVMLERKL PR 
2050 2070 2090 

CTGCCTGTGGCCTCGCTCCGGGATCTGCGGACGGGAGTATGGCCTGGGGGACCGCTGGTT 
CLWPRSGICGREYGLGDRWF 
2110 2130 2150 

CCTG CGGGTGGAAG ACAGGCAAG AT CTCAACCGGC AGCGGATCC AACG CTACG C ACAGGC 
LRVEDRQDLNRQRIQRYAQA 
2170 2190 2210 

CT T CC ACACCCGGGGCT CTG AG G ATTTGG AC AAAGACTCAGTGG AAAAACTAG AG CT GGG 
FHTRG S E DLDKDS-VEKLE LG 
2230 2250 2270 

CTGTCCCTTCAGCCCCCACCTGTCCCTTCCTACGOXTCAGTGTCTCGAAGTACCTCCCG 
CPFSPHLSLPTPSVSRSTSR 

2290 2310 2330 

CAGCAGTGCCAATTGGGAAAGGCTTCGGCAAGGGACCCTGAGGAGAGACCTGCGTGGGAT 
S SANWERLRQGTLRRDLRGI 

2350 2370 2390 

AATCAACAGGGGTCTGGAGGACGGGGAGAGCTGGGAATATCAGATCTGACTGCGTGTTCT 

INRGLE DGESWEYQI 

2410 2430 2450 

CACTTCGCTTCCTGGAACTTGCTCTCATTTTCCTGGGTGCATCAAACAAAACAAAAACCA 

2470 2490 2510 

AACACCCAGAGGTCTCATCTCCCAGGCCCCAGGGAGAAAGAGGAGTAGCATGAACGCCAA 

2530 2550 2570 

GGAATGTACGTTGAGAATCACTGCTCCAGGCCTGCATTACTCCTTCAGCTCTGGGGCAGA 
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Fig. 7 / continuation 2 



2590 2610 2630 

GGAAGCCCAGCCCAAGCACGGGGCTGGCAGGGCGTGAGGAACTCTCCTGTGGCCTGCTCA 

2650 2670 2690 

TCACCCTTC CGAC AGG AGC ACT G C ATGTCAG AGCACTTTAAAAAC AGG CCAGCCT GCTTG 

2710 2730 2750 

GGCCCTCGGTCTCCACCCCAGGGTCATAAGTGGGGAGAGAGCCCTTCCCAGGGCACCCAG 

2770 2790 2810 

GC AGGTGCAGGG AAGTGCAGAGCT TGTGG AAAGCGTGTG AGTGAG GGAG AC AG GAACGG C 

2830 2850 2870 

TCTGGGGGTGGGAAGTGGGGCTAGGTCTTGCCAACTCCATCTTCAATAAAGTCGTTTTCG 

2890 2910 
GATCCCTAAAAAAAAAAAAAAAAAAAAAAAAAA 



MGLSLPKEKGLILCLWSKFCRWFQREESWAQSRDEQNLLQQ 

ALYDNLEAAMVLMEAAPE LVra PMTSELYEGQTALHI AVVNO^I^LVRALLARRAS VS ARATGTAFRRS PRNLI YFGEHPLS FAAC 

WSEEIVRLLIEHGADIRAQDSLGfcTTVLHILILQPNKTFA^ 

I^QKRKHTQWTYGPLTSTLYDLTEIDSSGDEQSLI£LIITTKKREM 

MCCIYRPLKPRTNNRTS PRDNTLLQQKLLQEAYVTPKDDIRLVGELVTVIGAI I ILLVEVPDI FRMGVTRFFGQT ILGGPFHVLI I 
T YAFMVLVTMVMRLI S AS GE W PMS FAL VLGWCNVM Y FARGFQMLG P FT I M I QKMI FGD LNRFCWLMAWI LG FAS AFY 1 1 FQTE D 
PEELGH FY D Y PMALFS T FE1. FLT 1 1 DG PAN YN VDL P FMY S I T YAAFAI I ATL LMLNLL IAHMG DTHWRVAHERDE LWRAQI VATT V 
MI£RKLPRCLWPRSGICGREYGLGDRWFLRVEDRQDLNRQRIQRYAQAFHTRGSEDIiDKDSVEKLELGCPFSPHLSL?TPSVSRST 
SRS SANWERLRQGTLRRDLRGI INRGIiE DGESWEYQI 
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Figure 8: 



ATGGGTTTGTCACTGCCCAAGGAGAAAGGGCTAATTCTCT 
MGLSLPKEKGLILC 
250 270 290 

GCCTATGGAGCAAGTTCTGCAGATGGTTCCAGAGACGGGAGTCCTGGGCCCAGAGCCGAG 
LWSKFCRWFQRRESWAQSRD 
310 330 350 

ATGAGCAG AACCTG CTGCAGC AGAAGAGGATCTGGGAGT CTCCTCTCCPT CT AGCT GCCA 
EQNLLQQKRIWESPLLIAAK 
370 390 410 

AAGATAATGAT GT CC AGGCCCTGAACAAGT TGCTCAAGTATGAGGATTGC AAGGTGCACC 
DNDVQALNKLLKYEDCKVHQ 
430 450 470 

AG AG AGG AGCCATG GGGGAAACAG CGCTACACATAGC AGCCCTCT ATGACAACCTGGAGG 
RGAMGETALH IAALYDNLEA 
490 510 530 

CCGCCATGGTGCTGATGGAGGCTGCCCCGGAGCTGGTCTTTGAGCCCATGACATCTGAGC 
AMV LMEAAPE LVFEPMTS E L 
550 570 590 

TCT ATGAGGGTC AG ACTGCACTG C ACATCGCTGTTGTGAACCAGAAC ATGAAC CTGGTG C 
YEGQTALHIAVVNQNMNLVR 
610 630 650 

GAGCCCTGCTTGCCCGCAGGGCCAGTGTCTCTGCCAGAGCCACAGGCACTGCCTTCCGCC 
ALLARRASVSARATGTAFRR 
670 690 710 

GTAGTCCCTGCAACCTCATCTACTTTGGGGAGCACCCTTTGTCCTTTGCTGCCTGTGTGA 
SPCNLIYFGEHPLSFAACVN 
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Fig. 8 / contin i 1 

730 750 770 

ACAGTGAGGAGATCGTG CGGCTG CTCATT GAGCATGG AGCTGACAT CCGGGCCC AGGACT 
SEE IVRLLI EHGADIRAQDS 
790 810 830 

CCCTGGGAAACACAGTGTTACACATCCTCATCCTCCAGCCCAACAAAACCTTTGCCTGCC 
LGNTVLHIL ILQPNKTFACQ 
850 870 890 

AGATGTACAACCTGTTGCTGTCCTACGACAGACATGGGGACCACCTGCAGCCCCTGGACC 
MYNLLLSYDRHGDHLQPLDL 
910 930 950 

TCGTGCCCAATCACCAGGGTCTCACCCCTTTCAAGCTGGCTGGAGTGGAGGGTAACACTG 
VPNRQGLTP FKLAGVEGNTV 
970 990 1010 

TGATGTTTCAGCACCTGATGCAGAAGCGGAAGCACACCCAGTGGACGTATGGACCACTGA 
MFQHLMQKRKHTQWTYGPLT 
1030 1050 1070 

CCTCGACTCTCTATGACCTCACAGAGATCGACTCCTCAGGGGATGAGCAGTCCCTGCTGG 
STLYDLTEI DSSGDEQSLLE 
1090 1110 1130 

AACTTATCATCACCACCAAGAAGCGGGAGGCTCGCCAGATCCTGGACCAGACGCCGGTGA 
LIITTKKREARQ ILDQTPVK 
1150 1170 1190 

AGGAGCTGGTGAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTGCATGCTGGGTG 
ELVSLKWKRYGRPYFCMLGA 
1210 1230 1250 

CC ATATATCTGCTGT ACAT CATCTGCT T CAC CATGTG CTG C ATCTACCG CCCCCTCAAGC 
IYLLYI ICFTMCCIYRPLKP 
1270 1290 1310 

CCAGGACCAATAACCGCACGAGCCCCCGGGACAACACCCTCTTAGAGCAGAAGCTACTTC 
RTNNRTSPRDNTLLQQKLLQ 
1330 1350 1370 

AGGAAGCCTACATGACCCCTAAGGACGATATCCGGCTGGTCGGGGAGCTGGTGACTGTCA 
EAYMTPKDD I RLVGELVTVI 
1390 1410 1430 

TTGGGGCTAT CATCATCCT GCTGGT AG AGGTT CCAGACAT CTTCAGAAT GGGGGTCACT C 
G A I IILLVEV PDIFRMGVTR 
1450 1470 1490 

GCTTCTTTGGACAGACCATCCTTGGGGGCCCATTCCATGTCCTCATCATCACCTATGCCT 
FF.GQTI LGGPFHVLI ITYAF 
1510 1530 1550 

TCATGGTGCTGGTGACCATGGTGATGCGGCTCATCAGTGCCAGCGGGGAGGTGGTACCCA 
MVLVTMVMRL I SASGEVVPM 
1570 1590 1610 

TGTCCTTTGCACTCGTGCTGGGCTGGTGCAACGTCATGTACTTCGCCCGAGGATTCCAGA 
SFALVLGWCNVMYFARGFQM 
1630 1650 1670 

TGCTAG GCCCCTT CACCAT CATGAT T C AG AAGATGATTT-TTGGCGACCTGATGCGATTCT 
LGF FTIMI QKMIFG DLMRFC 
1690 1710 1730 

GCTGGCTGATGGCTGTGGTCATCCTGGGCTTTGCTTCAGCCTTCTATATCATCTTCCAGA 
WLMAVV ILG FASAFYI IFQT 
1750 1770 1790 

CAGAGGACCCCGAGGAGCTAGGCCACTTCTACGACTACCCCATGGCCCTGTTCAGCACCT 
EDPEELGHFY DYPMAL FSTF 
1810 1830 1850 

TCGAGCTGTTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGACCrGCCCTTCA 
ELFLTI IDGPANYNVDLPFM 
1870 1890 1910 

TGTACAGCATCACCTATGCTGCCTTTGCCATCATCGCCACACTGCTCATGCTCAACCTCC 
YSITYAAFAI IATLLMIjNLL 
1930 1950 1970 

TCATTGCCATGATGGGCGACACTCACTGGCGAGTGGCCCATGAGCGGGATGAGCTGTGGA 
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Fig. 8 / conti< Dn 2 

IAMMGDT HWRVAHE RDBLWR 
1990 2010 2030 

GGGCCCAGATTGTGGCX^CCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGC 
AQI VATT VMLE RKL PRCLWP 
2050 2070 2090 

CTCGCTCCGGGATCTGCGGACGGGAGTATGGCCTGGGAGACCGCTGGTTCCTGCGGGTGG 
RSGI CGREYGLGDRWFLRVE 
2110 2130 2150 

AAGAC AGG C AAG ATCTCAAC CGG CAG CGGATCCAACG CTACGCACAGG CCTT CC AC ACC C 
DRQDLNRQRIQRYAQAFHTR 
2170 2190 2210 

GGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGGCTGTCCCTTCA 
GSE DLDK DSVEKLELGCPFS 
2230 2250 2270 

GCCCCCACCTGTCCCTTCCTATGCCCTCAGTGTCTCGAAGTACCTCCCGCAGCAGTGCCA 
PHLSLPMPSVSRSTSRSSAN 
2290 2310 2330 

ATTGGGAAAGG CTTCGG CAAGGG ACCCTGAGGAG AGACCTGCGTGGGAT AATCAAC AGGG 
WERLROGTLRRDLRGI INRG 
2350 2370 2390 

GTCTGGAGGACGGGGAGAGCTGGGAATATCAGATCTGA 
LEDGESWEYQI* 



MGLSLPKEKGLILCLWSKFCRWFQRRESWAQSRDEQNLI^QKRIWESPI^LAAKDNDVOALNKLLKYEDCK^ 
ALYDNLEJ\AMVLMEAAPELVFEFOTSELYEGQTALHIAWNQNM^ 
VNSEEIVRLLIEHGADIRAQDSLGNTVIJlILILQPNKTFACQMYNLLLSYra 
I^QKRKHTQWTYGPLTSTLYDLTEIDSSGDEQSLLELIITTKKREARQILDQTFV^ 

MCCIYRPLKPRTNNRTSPRDNTLLQQKLU3EAYMTPKDDIRLVGEIiVTVIGAIIILLVEVPDIFRMGVTRFFGQTILGGPFHVLII 
TYAFMVLVTMVMRLISASGEWPMS FALVLGWCNVMYFARGFQMLGPFTIMIQKMI FGDLMRFCWLMAWI LGFASAFYI I FQTED 
PEELGHFYDYPMALFSTFELFLTIIDGPANYNVDLPFMYSITYAAFAIIATLLMLN 

tCLERKLPRCLWPRSGICGREYGLGDRWFLRVEDRQDLNRQRIQRYAQAFHTRGSEDLDKDSVEKLELGCPFS PHIjSIiPMPSVSRST 
S RS S ANWERLRQGTLRR D LRG I INRG LE DGE S WE YQI 



CAAACTCACAGCCCrrCTCCAAACTGGCTGGGGOTGCTGGGAGACrCCCAAGGAACTCGTCAGGAAGGCAGGAGACAGGAGACGGGA 
CCTCTAC^GGGAGACGGTGGGCCGGCCCTTGGGGGGGCTGATGTGGCCCC^GGCTGAGTCCCGT<^GGGTCTGGCCTCGGCCTCA 
GGCCCX:CAAGGAGCCGGCCCTACACCCCATGGGTTTGTC^CTGCCCAAGGAGAAAGGGCTAATTCTCrGCCTATGGAGCAAGTTCT 
GCAGATGGT TCCAG AG ACGGG AGT CCT GGGCCCAG AG CCG AGATGAGC AGAACCTGCTGCAGCAG AAG AGGAT CTGGGAGT CTCCT 
CTCCTTCTAG CT GCCAAAG AT AATGAT GT CCAGGCCC TGAACAAGTTGCT C AAGTATGAGGATTGCAAGGTGC ACCAG AG AGGAGC 
CATGGGGGAAACAGCG CT ACACAT AG C AGCCCTCT AT GACAACCTGGAGGC CGCCAT GGTGCTG ATGGAGGCTGCCCCGGAGCTGG 
TCTTTGAGCCCATGACATCTGAGCTCTATGAGGGTCAGACTGCACTGCACATCGCTGTTGTGAACCAGAACATGAACCTGGTGCGA 
GCCCTGCTTGCCCG(mGGGCCAGTGTCTCTGCCAGAGCCACAGGCACTGCCTTCCGCCGTAGTCCCa3CAACCTCATCTACTTTGG 
GGAGCACCCTTTGTCCTTTGCTGCCTGTGTGAACAGTGAGGAGATCGTGCGGCTGCTCATTGAGCATGGAGCTGACATCCGGGCCC 
AG GACTGCCTGG CCCAAC AAAACCTT TGC CTG CC AGATGTACAACCTTCTTG CTGT CCT ACG AC AGACATG GGGACC ACCT GC AGCC 
CCTGGACCTCGTGCCCAATCACCAGGGTCTCACCCCTTTCAAGCTGGCTGGAGTGGAGGGTAAC^CTGTGATGTTTCAGCACCTGA 
TGCAGAAGCGGAAGCACACCCAGTGGACGTATGGACCACTGACCTCGACTCTCTATGACCTCACAGAGATCGACTCCTCAGGGGAT 
GAGCAGTCCCTGCTGGAACTTATCATCACCACCAAGAAGCGGGAGGCTCGCGA^ 

GAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTGCATGCTGGGTGCCATATATCIGCTGTACATCATCTGCTTCACCATGT 
GCTGraTCTACCGCCCCCTC^.GCCCAGGACCAATAACCGCACAA^^ 

GAAGCCTACGTGACCCCTAAGGACGATATCCGGCTGGTCGGGGAGOTGGTGACTGTCATTGGGGCTATCATCATCCTGCTGGTAGA 

GGTTCCAGAC AT CTT CAG AATGGGGGTC ACT CGCTT CTTT GGACAGACC AT CCTT GGGGGCCCATTCCAT GTCCT C AT C ATCACCT 
ATGCCTTC^TGGTGCTGGTGACCATGGTGATGCGGCTC^TATGATTTTTGGCGACCrGATGCGATTCTGCTGGCTGATGGCTGTGG 
TCATCCTGGGCTTTGCTTCAGCCTTCTATATCATCTTCCAGACAGAGGACCCCGAGGAGCTAGGCCACTTCTACGACTACCCCATG 
GCCCTGTTCAGCACCTTCGAGCTGGTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCTTCATGTACAGCAT 
C^CCTATGCTGCCTTTGCCATCATCGCC^CACTGCTCATGCTCAACCTCCTCATTGCCATGATGGGCGACACTCACTGGCGAGTGG 
CCCATGAGCGGGATGAGCTGTGGAGGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGCCT 
CGCTCCGGGATCTGCGGACGGGAGTATGGCCTGGGGGACCGCTGGTTCCTGCGGGTGGAAGACAGGCAAGATCTCAACCGGCAGCG 
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GATCCAACGCTACGCACAGGCCTTCCACACCCGGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGGCTGTC 

CCTTCAGCCCCCACCTGTCCCTTCCTACGCCCTCAGTGTCTCGAAGTACCTCCCGCAGCAGTGCCAATTGGGAAAGGCTTCGGCAA 

GGGACCCT GAGGAGAGACCTG CGTGGGATAAT CAAC AGGGGTCTGGAGGACGGGG AGAGCTGGGAATATCAGATCTGACTGCGTGT 

TCTCACTTCGCTTCCTGGAACTTGCTCTGATTTTCCTGGGTGCATCAAACAAAACAAAAACCAAACACCCAGAGGTCrCATCTCCC 

AGGCCCC AGGG AGAAAG AGGAGT AGCATGAACGCC AAGG AATGT ACGT TGAG AAT CACTGCT CCAGGCCTGC AT T ACTCCTT CAGC 

TCTGGGGCAGAGGAJVGCCCAGCCC^AGCACGGGGCTGGCAGGGC^ 

GAGCACTGCATGTCAGAGCACTTTAAAAACAGGCCAGCCTGCTTGGGCCCTCGGTCT 

CCCTOCCCAGGGCACCCAGGCAGGTGCAGGGAAGTGCAGAGCTTGTGGAAAG 

GTGGGAAGTGGGGCTAGGTCTTGCCAACTCCATCTTCAATAAAGTCGTTTTCGGATCCCTAAZ^ 



CAAACTCACAGCCCTCTCC^AACTGGCTGGGGCTGCTGGGAGACTCCCAAGGAACTC^ 

CCTCTACAGGGAGACGGTGGGCCGGCCCTTGGGGGGGCTGATGTGGCCCCAAGGCTGAGTCCCGTCAGGGTCTGGCCTCGGCCTCA 
GGCCCCCAAGGAGCCGGCCCTACACCCCATGGGTTTGTCACTtGCCCAAGGAGAAAGGGCTAATTCTCTGCCTATGGAGCA 
GCAG AT GGTTCCAGAGACGGGAGTCCTGG GCCCAGAGCCGAG ATGAG CAGAACCTGCT GC AGCAGAAG AGG AT CTGGGAGTCT CCT 
CTCCTTCTAGCTGCCAAAG AT AATGATGT C(^GG CCCTGAACAAGTTGC^ CAAGTA 

CATGGGGGAAACAGCGCTACACATAGCAGCCCTCTATGACAACCTGGAGGCCGCCATGGTGCTGATGGAGGCTGCCCCGGAGCTGG 

TCTTTGAGCCCATGACATCTGAGCTCTATGAGGTCCTGACTGCCCATCACTTGAACGCCTGCCCCCTGAAATGCCAGGGCCTAGAG ' 

AAGAGG AAG AG ATGG GCAGC AGCTGG ATCCCCTGGGAATCCTG AAC ACCCG AGAGCT CCCTGTT CT CC AT C CC AGGCT ACCCCT GA 

GGGAAAGAGACTGGGGTGCATATGGGAGGGACCCCCTGCAGGATCCTGGGGACAGACCCGTGACTGACAGCTGTCTCTGGGCCAGG 

TCAGACTGCAOTGCACATCGCTGOTGTGAACCAGAACATGAACCT 

GAGCCACAGGCACTGCCTTCCGCCGTAGTCCCTGC^ 

AGTGAGGAGATCGTGCGGCTGCTmTTGAGCATGGAGCTGACATCCGGGCCCAGGACTCCCTGGCCCAACAAAACCTTTGCCTGCC 
AGAT GT AC AACCTGTT GOT GTCCTACGACAGACATGGGGACCACCT GCAGCCCCTGGACCTCGTG C CCAAT CACCAGG G T CTCACC 
CCTTT CAAGCTGGCT GG AGT GGAGGGT AACACT GTGATGTTT CAGC ACCTGATGCAGAAGCGGAAGC AC ACCCAGTGGACGT ATGG 
ACCACTGACCTCGACT CTCT ATGACCT CACAG AGATCG ACT CCTCAGGGGATGAGCAGT CCCTGCT GG AACTTATCATCAC C ACCA 
AGAAGCGGG AGG CT CGCCAGATCCTG G ACCAG ACGCCGGTG AAGGAGCTGGTGAGCCT C AAGTGGAAGCGGT ACGGGCGG CCGTAC 
TTCTGCATGCTGGGTGCGATATATCTGCTGTACATCATCTGCTTCACCATGTGCTGCATCTACCGCCCCCTCAAGCCCAGGACCAA 
T AACCG CACG AGCCC CCX3GG ACAACACCCTCTTACAG(2AGAAGCT ACTT(^^ CCGGC 
TGGT CGGGGAGCTGGTGACT GTCATT GGG GCT ATCAT CATCCT GCTGGT AGAGGTTCC AG ACAT CTT CAG AATGGGGGT C ACTCGC 
TTCTT T GGACAG ACCATCCT TGGGGG CCCATT CCATGTCCTC ATC ATCACCTATGCCTT CATGGTGCTGGT GACCATGGT G ATGCG 
GCTC^TC^GTGCCAGCGGGGAGGTGGTACCCATGTCCTTTGCACT^ 
TCCAGATGCTAGGCCCCTTCACC&TCATGATTGAGAAGATGATTTTTGGCGAC^ 

ATCCTGGGCTTTGCTTAGACAGAGGACCCCGAGGAGCTAGGCCACTTCTACGACTACCCCATGGCCCTGTTCAGCACCTTCGAGCT 
GGTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCTTCATGTACAGCATCACCTATGCrGCCTTTGCCATCA 
TCGCCACACTGCTCATGCTCAACCTCCTCATTGCC^TGATGGGCGACACTCACTGGCGAGTGGCCC^TGAGCGGGATGAGCTGTGG 
AGGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCT 

GTATGGCCTGGGAGACCGCTGGTTCCTGCGGGTGGAAGACAGGCAAGATCTCAACCGGCAGCGGATCCAACGCTACGCACAGGCCT 

TCCACACCCGGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACT^^ 

CCTATGCCCT^GTGTCTCGAAGTACCTCCCGCAGCTVGT^^ 

TGGGAT AAT CAACAGGGGT CTGGAGGACGGGGAGAGCTGGGAAT AT CAG ATCTGACT GCGTGTTCTCACTTCGCTT CCTGG AACTT 
GCTCTCATTTTCCTGGGTGCATCAAACAAAACAAAAACC^^ 

AGC ATG AACGCCAAG GAATGT ACGTT GAG AAT CACTGCTCCAGGCCTGCATT ACTCCTT C AGCTCTGGGGCAGAGGAAG CCC AGCC 
CAAGCACGGGGCTGGCAGGGCGTGAGGAACTCTCCTGTGGCCTGCTCATCACCCTTCCGACAGGAGCACTGCATGTCAGAGCACTT 
TAAAAACAGGCCAGCCTGCTTGGGCCCTCGGTCTCCACCCCAGGGTCIATAAGTGGGGAGAGAGCCCTTCCCAGGGCACCCAGGCAG 
GTGCAGGGAAGTGCAGAGCTTGTGGAAAGCGTGTGAGTGAGGGAGACAGGAACGGCTCTGGGGGTGGGAAGTGGGGCTAGGTCTTG 
CCAACTCCATCTTCAATAAAGTCGTTTTCGGATCCCTAAAAAAAAAAAAAAAAAAAAAAAAAA 



CAAACTCACAGCCCTCTCCAAACTGG<^GGGGCTGCTGGGAGACTCCCAAGGAACT 

CCTCTACAGGGAGACGGTGGGCCGGCCCTTGGGGGGGCTGATGTGGCCCCAAGGCTGAGTCCCGTCAGGGTCTGGCCTCGGCCTCA 
GGCCCCCAAGGAGCCGGCCCTACACCCCATGGGTTTGTCACTGCCCAAGGAGAZ^GGGCTAATTCTCTGCCTATGGAGCAAGTTCT 
GCTVGATGGTTCCAGAGACGGGAGTCCTGGGCCCAGAGCCGAGATGAGCAGAACCTGCTGCAGGAGAAGAGGATCTGGGAGTCn'CCT 
CrrCCTTCTAGCTGCCAAAGATAATGATGTCCAGGCCCTGAACAAGTTGCTCAAGTATGAGGATTGCAAGGTGCACCAGAGAGGAGC 
(^TGGGGGAAACAGCGCTACACATAGCAGCCCTCTATGACAACCTGGAGGCCGCCATGGTGCTGATGGAGGCTGCCCCGGAGCTGG 
TCTTTGAGCCCATGACATCTGAGCTCTATGAGGGTCAGACTGCACTGCACATCGCTGTTGTGAACCAGAACATGAACCTGGTGCGA 
GCCCTGCTTGCCCGCAGGGCCAGTGTCTCTGCCAGAGCCACAGGCACTGCCTTCCGCCGTAGTCCCCGCAACCTCATCTACTTTGG 
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AAACACAGTGTTACACATCCTCAT^ 

ATGGGGACCACCTGCAGCCCCTGGACCTCGTGCCCAATCACCAG^^ 

GTGATGTTTC AGC ACCTG ATG CAQAAG CGG AAGCAC ACCCAG TGGACGT ATGG ACCACTGACCTCGACT CTCT ATGACCT CACAGA 

GATCGACT CCT CAGGGGATGAG C AGTCCCTGCTGG AACTTAT CATCACCACC AAGAAGCGGG AGGCTCG CCIAG ATGCTGG ACCAG A 

O^CGGTGAAGGAGCTGGTGAGCCTCAAGTGGAAGCGGTACGGGC^ 

ATCATCTGCTTCACCATGTGCTGCATCTACCGCCCCXrr^ 

ACAGCAGAAGCTACTTCAGGAAGCCTACGTGACCCCTAAG 

TCATCATCCTGCTGGTAGAGGTTOCAGACATCTTCAGAATGGGGGTCACTCGCTTCT 

CATGTC CT CATCATCACCTATG CCTTC ATGGTGCTGGTGACC ATGGTGATGCGGCT C ATCAGTGCCAGCGGGGAGGTGGTACCC AT 
GTCCTTT GC ACT CGTGCTGGGCTGGT GC AACGT CATGT ACTTCGCCCG AGG ATTCC AG ATGCT AGGCCCCTT C ACCATC ATGATTC 
AGAAGATGATTTTTG^CGACCTGATGCGATTCTGCHX^CT^ 

TTCCAGACAGAGGACCCC^AGGAGCTAGGCC^CTTCrrACXACTACCCCATGGCCCTGTTCAGra 

C ATCG ATGGC CC^GCC AACTAC AACGTCGACCTGCCCTTC ATGT AC^GCATCACCT ATGCTGCC^TTGCCAT CATCG CCACACTG C 
TCATGCTCAACCTCCTCATTGCCATGATGGGCGACACTCACTGGCGilGTC 

GTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGCCTCGCTCCGGGATCTGCGGACGGGAGTATGGC 
GGACCGCT GGTTCCTGCGGGTGGAAGACAGGC^^GATCTCAACCGGCAGCGGATCCAACGCT 
GCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCT 

GTGTCTCGAAGT ACCT CCCG CAGCAGT GCXIAATTGGGAAAGGCTTCGGCAAGGGACCCTGAGGAGAGACCTGCGTGGGATAATCAA 

C TGGGTGCAT C AAAC AAAACAAAAACCAAAC ACCCAGAGGTCTCATCTCCCAGGCCCCAGGGAGAAAGAGGAGT AGC ATGAACGCC ' 
AAGG AATGT ACGTTGAGAATC ACT G CT CCAGGCCTGC ATT ACTCCTTCAGCTCTGGGG Q\GAGGAA 

TGGCAGGGCGTGAGGAACTCTCXrrGTGGCCTGCTCATCACCCTTCCGA^GGAGCACTGCATGTCAGAGCACTTTAAAAACA 
AGC CTGCTTGGGCCCTCGGTCTCCACCCCAG GGTCATAAGTGGG GAG AGAGCCCTT CCCAGGGCACCCAGGC AGGTG CAGGGAAGT 
G(^GAGCTTGTGGAAAGX^GTGTGAGTGAGGGAGACAGGAACGGCTCTGGGGGTGGGAAG 
T CAAT AAAGTCGTT TTCGGATCCCTAAAAAAAAAAAAAAAAAAAAAAAAAA 



CACACATGGGGCCTCCC^GGAGTGCCCAGGACCTCGTGCTGTTGGCCTCTGAATCTATCCT 

CATATAACCCACCTCTXTTGTAAATGCCAGGAGCC^^ 

CCATGGTGCTGATGGAGGCTGCCCCGGAGCTGGTCTTTGAGCCCOT 

GGGTG AAG AG CAGGAGTG ACGTGGTTGGGTATTCAAGT CAGTCTCTOTGATGGATAATrTGGGAAAGACACAGGGG ATCTG AGCCT 

CCTACTCTTTTTSTCTTCTCTGTCTCCCTTCCXn'GTCAGTC^ 

GCCTAGAGAAGAGGAAGAGATGGGCAGCAGCTGGJVTCCCCTGGGAATCCTG^^ 

ACCCCTGAGGGAAAGAGACTAGGGGTGCATATGGGAGGGACCCCCTGCAGGATCCTAGGGGACAGACCCGTGACTGACAGCTGTCT 
CTGGGCCAGGTCAGACTGCACTGCACATCGCTGTTGTGAACCAGAACATGAACCTGGTGCGAGC 

GTCTCTGCCAG^GCCACAGGCACTGCCTTCCGCCGTAGTCCCTGCAACCTCATCTACTTTGGGGAGCACCCTTTGTCCTT^ 
CTGTGTGAACAGTGAGGAGATCGTGCG^CTGCTCATTGAGCATGGAGCTGACATCCGGXXJCCAGGACTCCCTGGATGTACAACCTG 

TTGCTGTCCTACGACZAGAC^TGGGGACCACCTrGCAGCC^ 

TGGAGTGGAGGGTAACACTCTGATGTTTCAGC^CCTGJVTGCAGAAGCGGAAGCACACCX^GTGGAC^ 

CTCTCTATGACCTCACAGAGATCGACTCCTCAGGGGATGAGCAGTCCCTGCTGGAACTTATCATCACCACCAAGAAGCGGGAGGCT 
CGCCAGATCCTG G ACCAGACGCCGGT GAAGGAGCTGGTGAGCCT CAAGTGG AAG CGGTACGGGCG GCCGTACTTCTG CATGCTGGG 
TGCCATATATCTGCTGTACATCATCTGCTTCACCATGTGCTGCATCTACCGCCCCCTC^GCCCAGGACCAATAACCGCACGAGCC 
CCCGGGACAAGACCCTCTTACAGCAGAAGCTACTTCAGGAAGCCTACATGACCCCTAAGGACGATA 

GTGACTGTCATTGGGGCTATCATCATCCTGCTGGTAGAGGTTCCAGACATCTTCAGAATGGGGGTCACTCGCrTTCTTTGGACAGAC 

CATCCTTGGGGGCCC^TTCCATGTCCTCATCATCACCTATGCCTTCATGGTGCTGGTG^CCATGGTGATGCGXSCTCATCACTGCCA 

GCGGGGAGGTGGTACC^TGTCCTTTGCACTCGTGCTGGGCTGGTGCAACGTCATGTACTTCGCCCGAGGATTCCAGATGCTAGGC 

CCCTTCACCTTCATGATTCAGAAGATGATTTTTGGCGACCTGA^ 

TTC^GCCTTCTATATCATCTTCCAGACAGAGGACCCCGAGGAG^ 

TCGAGCTGGTCCTTACCIATCATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCTTCATGTACAGCATCACCTATGCTGCCTTT 
GCCATCATCGCCACACTGCTCATGCTCAACCTCCTCATTGCCATGATGGGCGACACTCACTGGCGAGTGGCCCATGAGCGGGATGA 
GCTGTGGAGGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGCCTCGCTCCGGGATCTGCG 
GACGGGAGTATGGCCTGGGAGACCGCTGGTTCCTGCGGGTGGAAGACAGGCAAGATCTCAACCGGCAGCG^ATCCAACGCTACGCA 
CAGGCCTTCCACACCCGGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGGCTGTCCCTTCAGCCCCCACCT 
GTCCCTTCCTATGCCCTCAGTGTCTCGAAGTACCTCCCGCAGCAGTGCCAATTGGGAAAGGCTTCGGCAAGGGACCCTGAGGAGAG 
ACCTGCGTGGGATAATCAACAGGGGTCTGGAGGACGGGGAGAGCTGGGAATATCAGATCTGACTGCGTGrrCTCACTTCGCTTCCT 
GGAACTTGCTCTCATTTTCCTGGGTGCATCAAACAAAACAAAAACCAAACACCCAGAGGTCTCATCTCCCAGGCCCCAGGGAGAAA 
GAGGAGTAGCATGAACGCC^GGAATGTACGTTGAGAATCACTGCTCCAGGCCTGCATTACTCCTTCAGCTCTGGGGCAGAGGAAG 
CCCAGCCCAAGCACGGGGCTGGCAGGGCGTGAGGAACTCTCCTGTGGCCTGCTCATCACCCTTCCGACAGGAGCACTGCATGTCAG 
AG C ACT TT AAAAAC AGGCC AG CCTGCTTGGGC CCT CGGTCTCCACCCC AG GGTCATAAGTGGG GAG AG AGC CCT TCCCAGGGCACC 
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CAGGCAGGTGCAGGGAAGTGCAGAGCTTGTGGAAAGCGTGTGAGTGAGGGAGACAGGAACGGCTCTGGGGGTGGGAAGTGGGGCTA 
GGTCTTGCCAACTCCATCTTCAAT&AACT 
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Figure 9: 
A. 



10 30 50 

CGGGGCCCTGGGCTGCAGGAGGTTGCGGCGGCCGCGGCAGCATGGTGGTGCCGGAGAAGG 

M V V P E K E 
70 90 110 

AG C AG AG CTGG ATCC CCAAG ATCT T CAAG AAG AAGACCTGCACG ACGTT CAT AGTT G ACT 
QSWIPKI FKK KTCTTFIVDS 
130 150 170 

CCACAGATCCGGGAGGGACCTTGTGCCAGTGTGGGCGCCCCCGGACCGCCCACCCCGCAG 
TDPGGTLCQCGRPRTAHPAV 
190 -210 230 

TGGCCATGGAGGATGCCTTCGGGGCAGCCGTGGTGACCGTGTGGGACAGCGATGCACACA 
* AME D A F. G A A V VTVWDS D A H T 

250 270 290 

CCACGGAGAAGCCCACCGATGCCTACGGAGAGCTGGACTTCACGGGGGCCGGCCGCAAGC 
TEKPTDAYGE LDFTGAGRKH 
310 330 350 

ACAGCAATTOCCTCCGGCTCTCTGACCGAACGGAT<XIAGCTGC^GTTTATAGTCTGGTCA 
SNFLRLS DRT DPAAVYSLV-T 
370 390 410 

CACGCACATGGGGCTTCCGTGCCCCGAACCTGGTGGTGTCAGTGCTGGGGGGATCGGGGG 
RTWGFRAPN LVVSVLGGSGG 
430 450 470 

GCCCCGTCCTCCAGACCTGGCTGCAGGACCTGCTGCGTCGTGGGCTGGTGCGGGCTGCCC 
PVLQTWLQ D L LRRGLVRAAQ 
490 510 530 

AGAGCACAGGAGCCTGGATTGTCACTGGGGGTCTX3CACACGGGCATCGGCCGGCATGTTG 
STGAWIVTGG LHTGIGRHVG 
550 570 590 

GTGTGGCTGTACGGGACCATCAGATGGCCAGCACTGGGGGCACCAAGGTGGTGGCCATGG 
VAVRDHQMAS TGGTKVVAMG 
■ 610 630 650 

GTGTGGCCCCCTGGGGTGTGGTCCGGAATAGAGACACCCTCATCAACCCCAAGGGCTCGT 
VAPWGVVRN RDTLINPKG SF 
670 690 710 

TCCCTGCGAGGTACCGGTGGCGCGGTGACCCGGAGGACGGGGTCCAGTTTCCCCTGGACT 
PARYRWRGD PEDGVQFPLDY 
730 750 770 

ACAACTACTCGGCCTrCTTCCTGGTGGACGACGGCACACACGGCTGCCTGGGGGGCGAGA 
NYSAFFLV D DGTHGCLGGEN 
790 810 830 

ACCGCTTCCGCTTGCGCCTGGAGTCCTACATCTCACAGCAGAAGACGGGCGTGGGAGGGA 
RFRLRLESYISQQKTGVGGT 

850 870 890 

CTGGAATTGACATCCCTGTCCTGCTCCTCCTGATTGATGGTGATGAGAAGATGTTGACGC 
GIDIPVLLLLID. GDEKMLTR 

910 930 950 

GAATAGAGAACGCCACCCAGGCTCAGCTCCCATGTCTCCTCGTGGCTGGCTCAGGGGGAG 
IENATQAQLPCLLVAGSGGA 

970 990 1010 

CTGCGGACTGCCTGGCGGAGACCCTGGAAGACACTCTGGCCCCAGGGAGTGGGGGAGCCA 
ADCLAETLE DTLAPGS GGAR 
1030 1050 1070 

GGCAAGGCGAAGCCCGAGATCGAATCAGGCGTTTCTTTCCCAAAGGGGACCTTGAGGTCC 
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QGEARDRIRRFFPKGDLEVL 
1090 1110 1130 

TGCAGGCCCAGGTGGAGAGGATTATGACCCGGMGGAGCTCCTGACAGTCTATTCTTCTG 
QAQVSRIMTRKELLTVYS SE 
1150 1170 1190 

AGGATG<X5TCTGAGGAATTCGAGACCATAGTTTT<3AAGGCCCTTGTGAAGGCCTGTGGGA 
DGSE EFETlVliKALVKACGS 
1210 1230 1250 

GCTCGGAGGCCTCAGCX^ACCTGGATGAGCTGCGTTTGGCTGTGGCTTGGAACCGCGTGG 
SEAS AYIiDELF. LAVAWNRVD 
1270 1290 1310 

AC ATTGCCC AG AGT GAACTCTTTCGGGGGGAC ATCC AATGGCGGTCCTT C CAT CTCG AAG 
IAQS ELFRGDIQWRSFHLEA 
1330 1350 1370 

CTTCCCTCATGGACC^CCTGCTGAATGACCGGCCTGAGTOCGTGCGCTTGCrCATTTC^ 
SLMDA1LNDRPEFVRLLISH 
1390 1410 1430 

ACGGCCTCAGCCTGX3GCCACTTCCTGACCCCGA7GCGCCTGGCCCAACTCTACAGCGCGG 
GLS LGHFLTPNRLAQLYS'AA 
1450 1470 1490 

CGCCCTCCAACTCGCTCATCCGCAACCTTTTGGACCAGGCGTCCCACAGCGCAGGCACCA 
PSN SLIRNLLDQASHSAGTK 
1510 1530 1550 

AAGCCCCAGCCCTAAPJIGGGGGAGCTGCGGAGCTCCGGCCCCCTGACGTGG3GCATGTGC 
APALKGGAAELRPPDVGHV L 
1570 1590 1610 

rGAGGATGCTGCTGGGGAAGATGTGCGCGCCGAGGTACCCCTCCGGGGGCGCCTGGGACC 
RMLLGKMCAPRYPSGGAW DP 
1630 1650 1670 

CTCACCCAGGCCAGGGCTTCGGGGAGAGCATGTATCTGCTCTCGGACAAGGCCACCTCGC 
HPGQGFGESMYLLSDKATSP 
1690 1710 1730 

CGCTCTCGCTG^TGCTGGCCTCGGGCAGGCCCCCTGGAGCGACCTGCTrCTTTGGGCAC 
LSIj DAGLGQAPWSDLLLWAL 
1750 1770 1790 

TGTTGCTGAACAGGG CACAG ATGGCCATGTACTT CTGGGAGATGGGTT CCAATGC AGTTT 
LLNRAQMAMYFWEMGSNAVS 
1810 1830. 1850 

CCTCAGCTCTTGG^GCCTGTTTGCTGCTCCGGGTGATGGCACGCCTGGAGCCTGACGCTG 
SALGACLLLRVMARLEPDAE 
1870 1890 1910 

AGGAGG CAG CACGG AGGAAAGACCTGG CGTT C AAGT rTGAGGGGATGGGCGTTGACCT CT 
EAARRKDLAFKFEGMGVD LF 
1930 1950 1970 

TTGGCGAGTGCTATCGCAGCAGTGAGGTGAGGGCTGCCCGCCTCCTCCTCCGTCGCTGCC 
GECYRSSEVRAARLLLRRCP 
1990 2010 2030 

CGCTCTGGGGGGATGCCACTTGCCTCCAGCTGGCCATGCAAGCTGACGCCCG1GCCTTCT 
L W G DA TCLQLAMQADARA FF 
2050 2070 2090 

TTGCCCAGGATGGGG7ACAGTCTCTGCTGACACAGAAGTGGTGGGGAGATATGGCCAGCA 
AQDGVQSLLTQKWWGDMAST 
2110 2130 2150 

CTACACCCATCTGGGCCCTGGTTCTCGCCTTCTrTrGCCCTCCACTCATCTACACCCGCC 
TPIWALVLAFFCPPLIYTRL 
2170 2190 2210 

tc atc acctt c agg aaat c ag aag aggag cc c ac acggg agg agctagagt ttgacat gg 
itfrbcseeeptreelefdmd 

2230 2250 2270 

ATAGTGTCATTAATGGGGAAGGGCCTGTCGGGACGGCGGACCCAGCCGAGAAGACGCCGC 
SVINGEGPVGTADPAEKT PL 
2290 2310 2330 
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TGGGGGTCCCGCGCCAGTCGGGCCGTCCGGGTTGCTGCGGGGGCCGCTGCGGGGGGCGCC 
GVPRC'SGRPGCCGGRCGGRR 
2350 2310 2390 

GGTGCCTACGCCECTGGTTCC^CTTCTGGGGCGTGCCGGTGACCATCTTCAiroGGCAACG 
CLRRWFHFWGVPVTIFMGNV 
2410 2430 2450 

TGGTCAGCTACCTGCTGTTCCTGCTGCTTTTCTCGCGGGTGCTGCTCGTGGftTTTCCAGC 
VSYLLFLLLFSRVLLVDFQP 
2470 2490 2510 

CGGCGCCGCCCGGCTCCCTGGAGCTGCTGCTCTA7TTCTGGGCTTTCACGCTGCTGTGCG 
APPGSLELL1YFWAFTLLCE 
2530 2550 2570 

AGGAACTGCGCCAGGGCCTGAGCGGAGGCGGGGGCAGCCTCGCCAGCGGGGGCCCCGGGC 
ELRQGLSGGGGS LASGGPGP 
2590 2610 2630 

CTGGCCATGCCTCi\CTGAGCCAG<IGCCTGCGCCTCTACCTCGCCGACAGCTGGAACCAGT 
GHASLSQRLRLYLADSWNQC 
2650 2670 2690 

GCGACCTAGTGGCTCTCACCTGtTTTCCTCCTGGGCGTGGGCTGCCGGCT'GACCCCGGGTT 
DLVALTCFLLGVGCRLTPGL 
2710 2730 2750 

TGTACCACCTGGGCCGCACTGTCCTCTGCATCGACTTCATGGTTTTCACtSGTGCGGCTGC 
YH.LGRTVLCI DFMVFTVRLL 
2770 2790 2810 

TT C AC ATCTT GACGGTCAACAAACAGCTGGGGCCCAAG ATCGT CATCGTGAGC AAG ATG A 
HIFTVNKQLGPKIVIVSKMM 
2830 2850 2870 

TGAAGGACGTGTTCTTCTTCCTCTTCTTCCTCGGCGrGTGGCTGGTAGCCTATGGCGTGG 
KDVFFFLFFLGVWLVAYGVA 
2890 2910 2930 

CCACGGAGGGGOTCCTGAGGCCACGGGACAGTGACrrCCCAAGTATCCTGCGCCGCGTCT 
TEGLLRPRDSDFPS ILRRVF 
2950 2970 2990 

TCTACCGTCCCTACCTGCAGATCTTCGGGCAGATTCCCCAGGAGGACATGGACGTGGCCC 
YRPYLQI FGQIPQEDMDVAL 
3010 3030 3050 

TCATGGAGCACAGCAACTGCTCGTCGGAGCCCGGCrTCTGGGCACACCCTCCTGGGGCCC 
ME HSNCSSEPGFWAHPPGAQ 
3070 3090 3110 

AGGCGGGCACCTGCGTCTCCCAGTATGCCAACTGGCTGGTGGTGCTGCTCCTCGTCATCT 
AGTCVSQYANWLVVLLLVIF 
3130 3150 3170 

T CCTGCTCGTGGCC AACATCCTGCTGGT CAACTTGCTCATTG CC ATGTTC AGTTACACAT 
LLVANILLVNLLIAMFSYTF 
3190 3210 3230 

TCGGCAAAGTACAGGGCAACAGCGATCTCTACTGGAAGGCGCAGCGTTACCGCCTCATCC 
GKVQGNSDLYWKAQRYRLIR 
3250 3270 3290 

GGGAATTCCACTCTCGGCCCGCGCTGGCCCCGCCCrTTATCGTCATCTCCCACTrGCGCC 
EFH SR PALAPPFIVI SHLRL 
3310 3330 3350 

TCCTGCTCAGGCAATTGTGCAGGCGACCCCGGAGCCCCCAGCCGTCCTCCCCGGCCCTCG 
LLRQLCRRPRSPQPSS PALE 
3370 3390 3410 

AGCATTTCCGGGTTTACCTTTCTAAGGAAGCCGAGCGGAAGCTGCTAACGTGGGAATCGG 
HFRVY LSKEAERKLLTWESV 
3430 3450 3470 

TGCATAAGGAGAACTTTCTGCTGGCACGCGCTAGGGACAAGCGGGAGAGCGACTCCGAGC 
HKEN FLLARARDKRES DSER 
3490 3510 3530 

GTCTGAAGCGCACGTCCCAGAAGGTGGACTTGGCACTGAAACAGCTGGGACACATCCGCG 
LKRTSQKVDLALKQLGHIRE 
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3550 3570 3590 

AGTACGAACAGCGCCrGAAAGTGCTGGAGCGGGAGGTCCAGCAGTGTAGCCGCGrCCTGG 
YEQRLKVLEREVQQCSRVLG 

3610 3630 3650 

GGTGGGTGGCCGAGGOCCTGAGCCGCTCTGCCTTGCTGCCCCCAGGTGGGCCGCCACCCC 
V7VAEALSRSALLPPGG PPPP 

3670 3690 3710 

CTGACCTGC CTGGGTCCJUVAG ACTG AG CCCTGCT GGCGGACT T CAAGG AG AAGCCCCC AC 
DLPGSKD* 

3730 3750 3770 

AGGGGATTTTGCTCCTAGAGTAAGGCTCATCTGGGCCTCGGCCCCCGCACCTGGTGGCCT 

3790 3810 3830 

TGTCCTTGAGGTGAGCCCCATGTCCATCTGGGCCACTGTCAGGACCACCTrTGGGAGTGT 

3850 3870 3890 

(^TCCTTACAAACCACAGCATGCCCGGCTCCTCCCAGAACCAGTCCCAGCCrGGGAGGAT 

3910 3930 3950 

CAAGGCCTGGATCCCGGGCCGTTATCCATCTGGAGGCTGCAGGGTCCTTGGGGTAACAGG 

3970 3990 401O 

G ACCAC AG AC CCCTC AC CACT CACAGATT CC*T C AC ACTGG G G AAAT AAAGCXIATTT CAGA 

4030 

GGftAAAAAAAAAAAAAAAAAAA 



MVVPEKEQSWIPKIFKKKTCTTTFIVDSTDPGGTLCQC^ 

SN FLRLS DRT DPAAVYSLVTRTWG FRAPNLWS VLGGS GG PVLQTWLQDLLRRGLVRAAQSTGAW I VTGG LHTG IGRKVG VAV 
QMASTGGTKVVAMGVAPWGVTONRDTLINPKGSFPARYRWRGDPED^^ 

QKTGVGGTGIDIPVLLLLIDGDEKNLTRIEmTQAHVPCLLVAGSRGLGMPGGTLEAHLAQDGDHKANQSTNQIXLPKDLSI^ 

SIDRKTLQSYSERLAVAWNRVDIAQSELFIIGDIQWRSFHLEASLM^ 

LIRNLLDQASHSAGTKAPALKGGAAELRPPDVGHVIJRM^ 

FWSDLLLWAIiIiLNRAC34AMYFWEMG3tlAVSSAl^^ 

RRC PLWGDATCLQIJ^QADARAFFAQDGVQSLLTQKWWGDMAS TTPIWALVIiAFFCPPIilYTRLI T FRKSEEEPTREELE FDfc 
INGEGPVGTADPAEKTPIX^PRQSGRPGCCGGRCGGRRCLRRWFHFWGVPVTIFMGNWSYLLFLLLFSRVLLVDFQPAPPGS 
LLYFWAFTIiCEEI^QGLSGGGGSIASGGPGPGHASI^QRLRLYIJUDSVraQCDLVALTCT 

FTVRLLHIETVNKQLGPKIVIVSKbIMKDVFFFLFFLGVWLVAYGVATEGLLRPRDSDFPSILRRVFYRPYLQIEX3QIPQEDMC 
MEHSNCS SEPGFWAHPPGAQAGTCVS QYANWLWLLI.V I FLLVAN I LLVN LLIAMFS YTFGKVQGN SDLYWKAQRYRLIRE F¥ 
ALAPPFIVISHLRIJiJlQLCRRPRSPQPSSPAliEHFTlVYL^ 

KQLGHIREYEQRiKVIJ^VQQCSRVLGWVAEALSRSALLPPGGPPPPDLPGSKD 

i 

10 30 50 

ATCC^TGGCGGTCCTTCCATCTCGAAGCTTCCCr^TGGACGCCCTGCTGAATGACCGG 

70 90 110 

CCTGAGTTCGTGCGCTTGCTCATTTCCCACGGCCrCAGCCTGGGCCACTTCCTGACCCCG 

130 150 170 

ATGCGCCTGGCCCAACTCTACAGCGCGGCGCCCrCCAACTCGCTCATCCGCAACCT'TTTG 

190 210 230 

GACCAGGCGTCCCAC AGCGCAGG CACCAAAG CCCCAGCC CT AAAAGGGGG AG CTG CGGAG 

250 270 290 

CTCCGGCCCCCTGACGTGGGGCATGTGCTGAGGATGCTGCTGGGGAAGA'TGTGCGCGCCG 

310 330 350 

AGATGTAT CTGCT CT CG G ACAAG G CC ACCT CG CCGCTCTCGCTG GATGCTG GCCT CGGGC 
MYLLS DKATS PLSLDAGLGQ 
370 390 410 

AGGCCCCCTGGAGCGACCTGCTTCTTTGGGCACrSTTGCTGAACAGGGCACAGATGGCCA 
APWSDLLLWALLLNRAQMAM 
430 450 470 

TGTACTTCTGGGAGATGGGTTCCAATGCAGTTTCCTCAGCTCTTGGGGCCTGTTTGCTGC 
YFWEMGSNAVSSALGACLLL 
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490 510 530 

TCCGGGTGATGGCACGCCTGGAGCCTGACGCrGAGGAGGCAGCACGGaGGAAAGACCTGG 
RVMARLEPOAEEAARRKDLA 
550 570 590 

CGTTCAAGTTTGAGGGGATGGGCGTTGACCTCTTTGGCGAGTGCTATCGCAGCAGTGAGG 
FKFEGMGVD LFGECYRS SEV 
610 630 650 

TGAGGGCTGCCCGCCTCCTCCTCCGTCGCTGCCCGCTCTGGGGGGATGCCACTTGCCTCC 
RAARLLLRR CPLWGDATCLQ 
670 690 710 

AGCTGGCCATGCAAGCXGACGCCCGTGCCTTCTTTGCCCAGGATGGGGTACAGTCTCTGC 
LAMQADARAFFAQDGVQSLL 
730 750 770 

TGACACAGAAGTGGTGGGGAGATATGGCCAGCACTACACCCATCTGGGCCCTGGTTCTCG 
TQKWWGDMASTT PIWALVLA 
790 810 830 

CCTTCTTTTGCCCTCCACTCATCTACACCCGCCTCATCACCTTCAGGAAATCAGAAGAGG 
FFCPPLIYTRLITFRKSEEE 
850 870 890 

AGCCCACACGGGAGGAGCTAGAGTTTGACATGGATAGTGTCATTAATGGGGAAGGGCCTG 
PTREEiEFDMDSVINGEGPV 
910 930 950 

TCGGGACGGCGGACCCAGCCGAGAAGACGCCGCTGGGGGTCCCGCGCCAGTCGGGCCGTC 
GTADPAEKT PLGVPRQSGRP 
970 990 1010 

CGGGTTGCTGCGGGGGCCGCTGCGGGGGGCGCCGGTGCCTACGCCX3CTGGTTCCACTTCT 
GCCGGRCGGRRCLRRWFHFW 
1030 1050 1070 

GGGGCGTGCCGGTGACCATCTTCATGGGCAACGTGGTCAGCTACCTGCTGTTCCTGCTGC 
GVPVTIFMGNVVSYLLPLLL 
1090 1110 1130 

rTTTCTCGCGGGTGCTGCTCGTGGATTTCCAGCCGGC^CCGCCCGGCTCCCTGGAGCTGC 
FSRVLLVDFQPAPPGSLELL 
1150 1170 1190 

TGCTCTATTTCTGGGCTTTCACGCTGCTGTGCGAGGAACTGCGCCAGGGCCTGAGCGGAG 
X»YFWAFTLI*CEELRQGLSGG 
1210 1230 1250 

GCGGGGGCAGCCTOGCCAGCGGGGGCCCCGGGCCTGGCCATGCCTCACTGAGCCAGCGCC 
GGS LASGGPGPGHASLSQRL 
1270 1290 1310 

TGCGCCTCTACCTCGCCGACAGCTGGAACCAGTGCGACCTAGTGGCTCTCACCTGCTTCC 
RLYLADSWNQCDLVALTCFL 
1330 1350 1370 

TCCTGGGCGTGGGCTGCCGGCTGACCCCGGGTTTGTACCACCTGGGCCGCACTGTCCTCT 
LGVGCRLTPGLYHLGRTVLC 
1390 1410 1430 

GCATCGACTTCATGGTTTTCACGGTGCGGCTGCTTCACATCTTCACGGrCAACAAACAGC 
IDFMVFTVRLLHIFTVNKQL 
1450 1470 1490 

TGGGGCCCAAGATCGTCATCGTGAGCAAGATGATGAAGGACGTGTTCTTCTTCCTCTTCT 
GPKIVIVSKMMKDVFFFLFF 
1510 1530 1550 

TCCTCGGCGTGTGGCTGGTAGCCTATGGCGTGGCCACGGAGGGGCTCCTGAGGCCACGGG 
LGVWLVAYGVATEGLLRPRD 

1570 1590 1610 

ACAGTGACTTCCCAAGTATCCTGCGCCGCGTCTTCTACCGTCCCTACCTGCAGATCTTCG 
SDFPSILRRVFYRPYLQI F G 
1630 1650 1670 

GGCAGATTCCCCAGGAGGACATGGACGTGGCCCTCATGGAGCACAGCAACTGCTCGTCGG 
QIPQEDMDVALMEHSNCSSE 

1690 1710 1730 

AGCCCGGCTTCTGGGCACACCCTCCTGGGGCCCAGGCGGGCACCTGCGTCTCCCAGTATG 
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PGFWAHPPGAQAGTCVSQYA 
1750 1770 1790 

CCAACTGGCTGGTGGTGCTGCTCCTCGTCATCTTCCTGCTCGTGGCCAACATCCTGCTGG 
NWLVVLLLVIFLLVANILLV 
1810 1830 1850 

TCAACTTGCTCATTGCCATGTTCAGTTACACATTCGGCAAAGTACAGGGCAACAGCGATC 
NLLIAMFSYTFGKVQGNSDL 
1870 1890 1910 

TCTACTGGAAGGCGCAGCGTTACCGCCTCATCCGGGAATTCCACTCTCGGCCCGCGCTGG 
YWKAQRYRLIREFHSRPALA 
1930 1950 1970 

CCCCGCCCTTTATCGTCATCTCCCACTTGCGCCTCCTGCTCAGGCAATTGTGCAGGCGAC 
PPFIVI S-HLRLLLRQLCR.RP 
1990 2010 2030 

CCCGGAGCCCCCAQCCGTCCTCCCCGGCCCTCGAGCATTTCCGGGTTTACCTTTCTAAGG 
RSPQPS SPALEHFRVYLSKE 
2050 2070 2090 

AAGCCGAGCGGAAGCTGCTAACGTGGGAATCGGTGCATAAGGAGAACTTTCTGCTGGCAC 
AERKLLTWESVHKENFLLA. R 
2110 2130 2150 

G C GCT AGGG ACAAGCGGGAGAGCG ACTCCGAGCGTCT GAAGCGCACG T CCCAGAAGGTGG 
ARDKRE SDSERLKRTSQKVD 
2170 2190 2210 

ACTT GGCACT GAAACAGCTGGGACACATCCGCG AGTACGAAC AGCGCCT GAAAGTGCTGG 
LALKQLGHIREYEQRLKVLE 
2230 2250 2270 

AGCGGGAGGTCCAGCAGTGTAGCCGCGTCCTGGGGTGGGTGGCCGAGGCCCTGAGCCGCT 
REVQQC SRVLGWVAEALSRS 
2290 2310 2330 

CTGCCTTGCTGCCCCCAGGTGGGCCGCCACCCCCTGACCTGCCTGGGTCCA^GACTGAG 
ALLFPGGPPPPDLPGSKD* 
2350 2370 2390 

CCCTGCTGGCGGACTTCAAGGAGAAGCCCCCACAGGGGATTTTGCTCCTAGAGTAAGGCT 

2410 2430 2450 

CATCTGGGCCTCGGCCCCCGCACCTGGTGGCCTTGTCCTTGAGGTGAGCCCCATGTCCAT 

2470 2490 2510 

CTGGGCCACTGTCAGGACCACCTTTGGGAGTGTCATCCTTACAAACCACAGCATGCCCGG 

2530 2550 2570 

CTCCTCCCAGAACCAGTCCCAGCCTGGGAGGATCAAGGCCTGGATCCCGGGCCGTTATCC 

2590 2610 2630 

ATCT GG AG GCTGCAGGGT CCTT G GGGT AAC AGGGACCACAGACCC CT CACCACTCAC AGA 

2650 2670 2690 

TT CCTCAC ACTGGGG AAATAAAGCCATTT C AGAGGAAAAAAAAAAAAAAAAAAAA 



MYLLS DKAT S PLS LDAGLGQAPW S DLLLWALLLNRAQMAMYFWEMGSNAVS S ALGACLLLRVMARLEP DAEEAARRKDLAFKFEGM 

GVDLFGECYRSSEVRAARLLLRRCPLWGDATCLQLAMQADARAFFAQDGVQ^^ 

TFRKSEEEPTREELEFDMDSVINGEGPVGTADPAEKTPLGVPRQSGRPGCOT 

LLFSRVLLVDFQPAPPGSIJSLLLYFWAFTIjLCEELRQGL^ 

CRLTPGLYHLGRTVLCIDFMVFTVRLLHIFTVNKQLGPKIVIV^ 

FYRPYLQI FGQI PQEDMDVAI^EHSNCSSEPGFIflAHPPGAQAGTCVSQYANWLWIiLLVIFliLVANILLVNLLIAMFS YT FGKVQG 
N S D L YWKAQR YRLI RE FHSRPALA PPFIVI SHLRLLLRQ LCRRPRS PQ P S S PALE H FRVYL SKE AERKLLTWE S VHKEN FLLARAR 
DKRESDSERLKRTSQKVDLALKQLGHIREYEQRLKVLEREVQQCSRVLGWVAEALSRSALLPPGGPPPPDLPGSKD 
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Fig. 10 



10 30 50 

ATTAAAGTTTATAAAACAGTGGCTGGATGGTTGGAGGATGCAGGTGGACAGAAGACGTGG 

MVGGCRWTEDVE 
70 90 110 

AGCCTG<^GAAGTAAAGGAAAAGATGTCCTTTCGGGCAGCCAGGCTCAGCATGAGGAACA 
PAEVKEKMSFRAARLSNRNR 
130 150 170 

GAAGGAATGACACTCTGGACAGCACCCGGACCCTGTACTCCAGCGCGTCTCGGAGCACAG 
RNDTLDSTRTLYSSASRSTD 
190 210 230 

ACTTGTCTTACAGTGAAAGCGCCAGCTTCTACGCTGCCTTC^GGACACAGACGTGCCCAA 
LSYSESAS FYAAFRTQTC P I 
250 270 290 

TCATGGCTTCTTGGGACTTGGTGAATTTTATTCAAGCAAATTTTAAGAAACGAGAATGTG 
MASWDLVN FIQANFKKRECV 
310 ' 330 350 

TCTTCTTTACCAAAGATTCCAAGGCCACGGAGAATGTGTGCAAGTGTGGCTATGCCCAGA 
FFTKDSKATENVCKCGYAQS 
370 390 410 

GCCAGC ACATGG AAGGGAC CC AG AT CAACCAAAGTGAGAAATGG AACTAC AAG AAAC ACA 
QHMEGTQINQSEKWNYKKHT 
430 450 470 

CCAAGGAATTTCCTACCGACGCCTTTGGGGATATTCAGTTTGAGACACTGGGGAAGAAAG 
KE FPTDAFGD IQFETLGKK'G 
490 510 530 

GGAAGTATATACGTCTGTCCTGCGACACGGACGCGGAAATCCTTTACGAGCTGCTGACCC 
KYIRLSCDTDAEILYELLTQ 
550 570 590 

AGCACT GGCACCTGAAAAC ACCCAACCTGGT CATTT CTGTG ACCGGGGGCGCCAAG AACT 
HWHLKTPNLV ISVTGGAKNF 
610 630 650 

TCGCCCTGAAGCCGCGC ATG CGCAAGATCTT CAGCCGG CTC ATCTACAT CGCGCAGT CCA 
ALKPRMRKIFSRLIYIAQSK 
670 690 110 

AAGGTG CTTGGATTCTC ACG GGAG GC ACCCAT TATGGCCTGATGAAGT AC ATCGGG G AGG 
GAWILTGGTHYGLMKYIGEV 
730 750 "770 

T GGTG AG AjGAT AACACC AT CAGC AGG AGTTCAG AGG AGAAT ATTGTGG C C ATTGG C AT AG 
VRDNTISRSSEENIVAIGIA 

790 810 830 

CAGCTTGGGGCATGGTCTCCAACCGGGACACCCTCATCAGGAATTGCGATGCTGAGGGCT 
AWGMVSNRDT LIRNCDAEGY 
850 870 890 

ATTTTT TAGCCCAGT AC CT T ATGG AT G ACTT CAC AAGAG ATCC ACTGTAT ATCCT GG AC A 
FLAQYLMDDFTRDPLYILDN 
910 930 950 

ACAACC AC AC ACATTT GCT G CT CGT GGACAAT GG CTGTCATG G ACAT C C C ACTGT CG AAG 
NHTHLLLV DNGCHGHPTVEA 
970 990 1010 

C AAAG CTCCGG AATC AGCT AGAG AAGTAT AT CT CTGAGCGC ACT ATT C AAGATT CCAACT 
KLRNQLEKYI SERTIQDSNY 
1030 1050 1070 

ATGGTGGCAAGATCCCCATTGTGTGTTTTGCCCAAGGAGGTGGAAAAGAGACTTTGAAAG 
GGKIPIV v C FAQGGGKETLKA 
1090 1110 H30 

CCATCAATACCTCCATCAAAAATAAAATTCCTrGTGTGGTGGTGGAAGGCTCGGGCCAGA 
I NTSIKNKIPCVVVEGSGQI 
1150 1170 H90 

TCGCTGATGTGATCGCTAGCCTGGTGGAGGTGGAGGATGCCCTGACATCTTCTGCCGTCA 
ADVIASLVEVEDALTSSAVK 
1210 1230 1250 
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AGGAGAAGCTGGTGCGCTTTTTACCCCGCACGGTGTCCCGGCTGCCTGAGGAGGAGACTG 
EKLVRFLPRTVSRLPEEETE 
1270 1290 1310 

AGAGTTGGATGAAATGGCTCAAAGAAATTCTCGAATGTTCTCACCTATTAACAGTTATTA 
SWIKWLKEX LECSHLLTV I K 
1330 1350 1370 

AAATGGAAG AAGCT GGGG AT G AAATTGTG AGC AAT GCC AT CTCCT ACG CTCT ATAC AAAG 
MEEAGDEIV SWAIS Y A L Y K A 
1390 1410 1430 

CCTTCAGCACCAGTGAGCAAGACAAGGATAACTGGAATGGGCAGCTGAAGCTTCTGCTGG 
FSTSEQDKDNWNGQLKLLLE 
1450 1470 1490 

AGTGGAACCAGCTGGACTTAGCCAATGATGAGATTTTCACCAATGACCGCCGATGGGAGA 
WNQLDLANDEIFTNDRRWEK 
1510 1530 1550 

AG AG CAAACCG AGGCT CAGAGAC ACAATAATCCAGGTCACATGGCTGG AAAAT GGT AGAA 
SKPRLRDTI IQVTWLENGRI 
1570 1590 1610 

TCAAGGTTGAGAGCAAAGATGTGACTGACGG CAAAGCCTCTT CTCATATG CTGGT GGTT C 
KVE S KDVTDGKASS HMLVVL 
1630 1650 1670 

TCAAGTCTGCTG ACCTT C AAGAAGT CATGTTT ACGGCTCTC ATAAAGG ACAGACCCAAGT 
KSADLQEVMFT.ALI KDR P K F 
1690 1710 1730 

TTGTCCGCCTCTTTCTGGAGAATGGCTTGAACCTACGGAAGTTTCTCACCCATGATGTCC 
VRLFLENGLNLRKFLTHDVL 
1750 1770 1790 

T CACTGAACT CT TCT C CAACC ACTT CAGC ACGCTTGTGTACCGG AATCTGC AG AT CGCC A 
TELFSNHFSTLVYRNLQIAK 
1810 1830 1850 

AGAATTCCTATAATGATGCCCTCCTCACGTTTGTCTGGAAACTGGTTGCGAACTTCCGAA 
NSYNDALLTFVWKLVAN FRR 
1870 1890 1910 

G AGGCTT CCGGAAGG AAG AC AG AAATGGCCGGGACGAGATGG ACAT AG AACT CC ACGACG 
GFRKEDRNGRDEMDIELHDV 
1930 1950 1970 

TGTCTCCTATTACTCGGCACCCCCTGCAAGCrCTCTTCATCTGGGCCATOCTTCAGAATA 
SP ITRHPLQALFIWAILQNK 
1990 2010 2030 

AGAAGGAACTCTCCAAAGTCATTTGGGAGCAGACCAGGGGCTGCACTCTGGCAGCCCTGG 
KELSKVIWEQTRGCTLAALG 
2050 2070 2090 

GAGCCAGCAAGCTT CTG AAGACT CTGGCCAAAGTGAAGAACG ACATCAATGCT GCTGGGG 
A S K L LKTLAKVKN D INAAGE 
2110 2130 2150 

AGTCCGAGGAGCTGGCTAATGAGTACGAGACCOGGGCTGTTGGTGAGTCCACAGTGTGGA 
SEELANEYETRAVG ESTVWN 
2170 2190 2210 

ATGCTGTGGTGGGCGCGGATCTGCCATGTGGCACAGACATTGCCAGCGGCACTCATAGAC 
AVVGADLPCGTDIASGTHR P 
2230 2250 2270 

CAGATGGTGGAGAGCTGTTCACTGAGTGTTACAGCAGCGATGAAGACTTGGCAGAACAGC 
DGGELFTECYSSDE DLAEQL 
2290 2310 2330 

TGCTGGTCT ATT CCTGTGAAGCTTGGGGTGG AAG C AACT GTCTGGAGCTGGCGGTGGAGG 
LVYSCEAWGGSNCLELAVEA 
2350 2370 2390 

CCACAGACCAGCATTTCATCGCCGAGCCTGGGGTCCAGAATTTTCTTTCTAAGCAATGGT 
TDQHFIAQPGVQNFLSKQWY 
2410 2430 2450 

ATGGAGAGATTTCCCGAGACACCAAGAACTGGAAGATTATCCTGTGTCTGTTTATTATAC 
GEISRDTKNWKIILCLFIIP 
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2470 2490 2510 

CCTTGGTGGGCTGTGGCTTTGTATCATTTAGGAAGAAACCTGTCGACAAGCACAAGAAGC 
LVGCGFVSFRKKPVDKHKKL 

2530 2550 2570 

TGCTTTGGTACTATGTGGCGTTCTTCACCTCCCCCTTCGTGGTCTTCTCCTGGAATGTGG 
LWYYVAFFTSPFVVFSWNVV 
2590 2610 2630 

TCTTCTACATCGCCTTCCTCCTGCTGTTTGCCTACGTGCTGCTCATGGATTTCCATTCGG 
FYIAFLLLFAYVLLMDFHSV 
2650 2670 2690 

TGCCACACCCCCCCGAGCTGGTCCTGTACTCGCTGGTCTTTGTCCTCTTCTGTGATGAAG 
PHPPELVLYSLVFVLFCDEV 
2710 2730 2750 

TGAGACAGGGCCGGCCGGCTGCTCCCAGTGCGGGGCCCGCCAAGCCCACGCCCACCCGGA 
RQG RPAAPSAGPAKPT PTRN 
2770 2790 2810 

ACTCCATCTGGCC^CTAGCTCCACACGCAGCCCCGGTTCCCGCTC^CGCa^CTCCTTCC 
S IW PASSTRS PGSRSRHS FH 
2830 - . 2850 2870 

ACACTTCCCrTGCAAGCTGAGGGTGCC^GCTCTGGCCTTGGCCAGCCCAGAAAGGGGTGGA 
TSLQAEGASSGLGQPRKGWT 

2890 2910 2930 

CATTTAAAAATCTGGAAATGGTTGATATTTCCAAGCTGCTGATGTCCCTCTCTGTCCCTT 
FKNLBMVDISKLLMSLSVPF 
2950 2970 2990 

rCTGTACGCAGTGGTACGTAAATGGGGTGAATTATTTTACTGACCTGTGGAATGTGATGG 
CTQWYVNGVNYFTDLWNVMD 
3010 3030 3050 

ACACGCTGGGGCTTTTTTACTTCATAGCAGGAATTGTATTTCGGCAAGGGATCCTTAGGC 
TLGLFYFIAGIVFRQGILRQ 
3070 3090 3110 

AGAATG AGCAGCGCTGGAGGT GGATATTCCGTT CGGTCATCT ACGAGCCCTACCT GG CCA 
NEQRWRWI FRSVIYE PYLAM 
3130 3150 3170 

TGTTCGGCCAGGTGCCCAGTGACGTGGATGGTACCACGTATGACTTTGCCCACTGCACCT 
FGQVPSDVDGTTYDFAHCTF 
3190 3210 3230 

TCACT GGGAATGAGT CCAAGCCACTGTGTGTGGAGCTGGATGAGCACAACCTGCCCCGGT 
TGNESKPLCVELDEHNLPRF 
3250 3270 3290 

TCC CCGAGT G GAT CACCATCCCCCTGGTGT G C ATCTACATGTT ATCC ACCAAC ATCCTGC 
PEWITIPLVCIYMLSTNILL 
3310 3330 3350 

TGGTCAACCTGCTGGTCGCCATGTTTGGCTACACGGTGGGCACCGTCCAGGAGAACAATG 
VNLLVAMFGYTVGTVQENND 
3370 3390 3410 

ACCAGGTCTGGAAGTTCCAGAGGTACTTCCTGGTGCAGGAGTACTGCAGCCGCCTCAATA 
QVWKFQRYFI.VQEYCSRLNI 
3430 3450 3470 

TCCCCTTCCCCTTCATCGTCTTCGCTTACTTCTACATGGTGGTGAAGAAGTGCTTCAAGT 
PFPFIVFAYFYMVVKKCFKC 
3490 3510 3530 

GTTGCTGCAAGGAGAAAAACATGGAGTCTTCTGTCTGCTGTGAGTGGTTTATCCATGTGT 
CCKEKNMES SVCCEV7 FIHVY 
3550 3570 3590 

ACTTGGGATC AG AAG CAGCG ATT AATTT C AGGGAAG G ATGCCT GC ATCCAGT G ATTGGAA 
LGSEAAINFREGCLHPVIGS 

3610 3630 3650 

GCTGGACCCCAGGCTGGCTGGTCTGGACATCCACACGCATTCTCACATGCAGTGCCGGCT 
WTPGWLVWT STRILTCSAGW 

3670 3690 3710 

GGCCAGCAGCAGGGAGTCTCAGTGTCACCACACATAGCAGCTGGGTTCCTGCAAAAAGCA 
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PAAGSLSVTTHSSWVPAKSS 
3730 3750 3770 

GCAAGTCACAGGCCCACCCAGACAGAACGGGTAGAGAATGTGACTCTGCTTCTGGGTGGG 
KSQAHPDRTGRECDSASGWE 
3790 3810 3830 

AAGGACAGCCTGCCCGGTGGGTGGAAGAATCCGT'SGCCCTGTTTGGCCATCG'TGGCCCTG 
GQPARWVEE SVALFGHRGPV 
3850 3870 3890 

TTTGGCCACCTACCACTCTAGGCATCACTGAGC'rGAATGCGCCGGTCCTCTGA 
WPPTTLGITELNAPVL* 



MVGGC^WTEDVEPAEVKEKMSFRAARLSMRNRRKDTLDST^ 
KKRECVFFTKDSKATENVCKCGYAQSQHKQ^ 

HWHIOTPNLVISVTGGAKNFALKPRMWCIFSRLIYIAQSKGAWILTGGTHYGLMKYIGEWRDN 

NRDTLIRNCDAEGYFLAQYLMDDr/RDPLYILDNNHTH LLLVDNGCHGHPT VEAKLRNQI.EKYI SERTI QDSNYGGKI PIVCFAQG 
GGKETLKAINTS I KNKI PC WVE G S G QIADV I AS LVEVE DALTSS AVKEKLVRFLPRTVSR.L PEEETE S WXKWLKE I LECSHLLTV 
IKMEEAGDE I VSNAI S YALYKAFST SEQDKDNWNGQLKLLLEWNQLDLANDE I FTNDRRWEKS KPRLRDT I IQVTW LEN GRIKVE S , 
KDVTDGTOSSHMLWLKSADI^EVMHTALIKDRPKFVRLF 

VWKLV AN FRRG FRKE DRNGRDEM D I ELH DVS PIT RH PI»Q ALFTWAIIiQN KKE LS KV IWEQTRG CT LAALGASKLLKT LAKVKN DIN 
AAGESEEl^EYETRAVCTSTWNAWGADLPCGTDIASGTH^ 

QHFIAQPGVQNFLSKQWYGEI SRDTKNWKI ILCLFI I PL.VGCGFVS FRKKPVDKHKKLLWY YVAFFT S PFWFSWNWFYIAFLLL 

FAYVlXt^FH^PHPPELVLYSI.VFVLFCDEVRQGRPAAPSAGPAKP^ 

PRKGWFKNLEMVDISKLIiMSLSVPFCTC^YVNGVNYFTDLWNV^^ 

FGQVPSDVDGTT YDFAHCTFTGNESKPLCVELDEHNLPRFPEWITI FLVCI YHLSTN ILLVNLLVAM FG YTVGTVQENHDQVWKFQ 
RYFLVQEYCSRLNI P FPFI VFAYFmWKKCFKCCCKEKNMESSVCCEWFI HVYLGSEAAIN FREGCLHPVIGSWT PGWLVWTSTR 
ILTCSAGWPAAGSLSVTTHSSWPAKSSKSC^PDRTGRECDS 



Q L 

2290 2310 2330 

TGCTGCTCTATTCCTGTGAAGCTTGGGGTGGAAGCAACTGTCTGGAGCTGGCGGTGGAGG 
L V Y S CEAWG GSN CLELAVEA 
2350 2370 2390 

C CACAG ACCAGCATT T CATCG CC CA.GC CrGG^GfTCCAGAATTT T CTTTCT AAG C AATGGT 
TDQHFIAQPGVQNFLSKQWY 
2410 2430 2450 

ATGGAGAGATTTCCCGAGACACCAAGAACTGGAAGATTATCCTGTGTCTGTTTATTATAC 
GEI SRDTKNWKI ILCLFIIP 
2470 2490 2510 

CCTTGGTG^GCTGTGGCrTTGTATCATTTAGGAAGAAACCTGTCGACAAGCACAAGAAGC 
LVGCGFVS FRKK P VDK 
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Figure 11: 

a.) TrplOb cDNA and derived amino acid sequence 



10 30 50 

ATGAAATCCTTCCTTCCTGTCCACACCATCGTGCTTATCAGGGAGAATGTGTGCAAGTGT 
MKSFIiPVHTIVLIRENVCKC 

70 90 110 

GGCTATGC CCAG AGC CAGCAC ATGG AAG GCACC CAG ATCAACCAAAGTG AG AAATGG AAC 
GYAQSQHMEGTQINQSEKWN 

130 150 170 

TACAAGAAACACACCAAGGAATTTCCTACCGACGCCTTTGGGGATATTCAGTTTGAGACA 
Y KKHTKE F PTDAFGDIQFET 

190 210 230 

CTGGGGAAGAAAGGGAAGTATATACGTCTGTCCTGCGACACGGACGCGGAAATCCTTTAC 
L GKKGKY I RL S CDTDAE I" L Y 

250 270 290 

GAGCTGCTGACCCAGCACTGGCACCTGAAAACACCCAACCTGGTCATTTCTGTGACCGGG 
ELLTQHWHbKTPNLVISVTG 

310 330 350 

GGCGGCAAGAACTTCGCCrCTGAAGCCGCGCATGCGCAAGATCTTCAGCCGGCTCATCTAC 
GAKNFALKPRMRKIFSRL IY 

370 390 410 

ATCGCGCAGTCCAAAGGTGCTTGGATTCTCACGGGAGGCACCCATTATGGCCTGATGAAG 
1 AQS KGAW I LTGGTHYGLMK 

430 450 470 

TACATCGGGGAGGTGGrGAGAGATAACACCATCAGCAGGAGTTCAGAGGAGAATATTGTG 
YIGEVVRDNTISRSSEENIV 

490 510 530 

GCCATTGGCATAGCAGCTTGGGGCATGGTCTCCAACCGGGACACCCTCATCAGGAATTGC 
AIGIAAWGMVSNRDTLIRNC 

550 570 590 

GATGCTGAGGGCTATTTTTTAGCCCAGTACCTTATGGATGACTTCACAAGAGATCCACTG 
DAEGYFLAQYIiMDDFTRDPL 

610 630 650 

TATATCCTGGACAACAACCACACACATTTGCTGCTCGTGGACAATGGCTGTCATGGACAT 
YILDNNHTHLLLVDNGCHGH 

670 690 710 

CCCACTGTCGAAGCAAAGCTCCGGAATCAGCTAGAGAAGTATATCTCTGAGCGCACTATT 
PTVEAKLRNQLEKYISERTI 

730 750 770 

CAAGATTCCAACTATGGTGGCAAGATCCCCATTGTGTGTTTTGCCCAAGGAGGTGGAAAA 
QDSNYGGKIPIVCFAQGGGK 

790 810 830 

GAGACTTTGAAAGCCATCAATACCTCCATCAAAAATAAAATTCCTTGTGTGGTGGTGGAA 
ETLKAINTSIKNK IPCVVVE 

850 B70 890 

GGCTCGGGCCAGATCGCTGATGTGATCGCTAGCCTGGTGGAGGTGGAGGATGCCCTGACA 
GSGQIADVIASLVEVEDALT 

910 930 950 

TCTTCTGCCGTCAAGGAGAAGCTGGTGCGCTTTTTACCCCGCACGGTGTCCCGGCTGCCT 
S SAVKEKLVRFL PRTVS RLP 

970 990 1010 

GAGGAGGAGACTGAGAGTTGGATCAAATGGCTCAAAGAAATTCTCGAATGTTCTCACCTA 
EEETESWI KWIjKEILiECSHL 

1030 1050 1070 

TTAACAGTTATTAAAATGGAAGAAGCTGGGGATGAAATTGTGAGCAATGCCATCTCCTAC 
LTVIKMEEAGDEIVSNAISY 

1090 1110 1130 

GCTCTATACAAAGCCTTCAGCACCAGTGAGCAAGACAAGGATAACTGGAATGGGCAGCTG 
ALYKAFSTSEQDKDNWNGQL 
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Fig. 11 (Continuation) 



2410 2430 2450 

AG AAACTT AGGAC CC AAGATTAT AATG CTG C AG AGGATG C TG ATCG ATGTGTTCTTC TTC 
RNLGPKIIMLQRMLIDVFFF 

2470 2490 2510 

CTGTTCCTCTTTGCGGTGTGGATGGTGGCCTTTGGCGTGGCCAGGCAAGGGATCCTTAGG 
LFLFAVWMVAFGVARQGI Ii R 

2530 2550 2570 

C AGAATGAGCAG CGCTGGAGGTGGATATTC CGTTCGGTC ATCTACGAGCCCTAC CTGG CC 
QNEQRWRWI FRSVIYEPYLA 

2590 2610 2530 

ATGTTCGGCCAGGTGCCCAGTGACGTGGATGGTACCACGTATGACTTTGCCCACTGCACC 
MFGQVPSDVDGTTYDFAHCT 

2650 2670 2690 

TTCACTGGGAATGAGTCCT^GCCACTGTGTGTGGAGCTGGATGAGCACAACCTGCCCCGG 
FTGNESKPLCVELDEHNLPR 

2710 2730 2750 

TTCC C CG AGTGG ATCAC CATC C CC CTGGTGTGCATCTACATGTTAT C C ACC AAC AT CCTG 
FPEWITIPLVCIYM LSTNI L 

2770 2790 2B10 

CTGGTCAACCTGCTGGTCGCCATGTTTGGCTACACGGTGGGCACCGTCCAGGAGAACAAT 
LVNLLVAMFGYTVGTVQENN 

2830 2850 2870 

GACCAGGTCTGGAAGTTCCAGAGGTACTTCCTGGTGCAGGAGTACTGCAGCCGCCTCAAT 
DQVWKFQRYFLVQEYCSR LN 

2890 2910 2930 

ATCCCCTTCCCCTTCATCXBTCTTCGCTTACTTCTACATGGTGGTGAAGAAGTGCTTCAAG 
IPFPFIVFAYFYMVVKKCFK 

2950 2970 2990 

TGTTGOTGCAAGGAGAAAAACATGGAGTCTTCTGTCTGCTGTTTCAAAAATGAAGACAAT 
CCCKEKNME SSVCCFKNEDN 

3010 3030 3050 

GAGAC TCTGGC ATGGGAGGGTGTCATG AAGG AAAACTAC CTTGTC AAGATC AACACAAAA 
ETLAWEGVMKENYLVKINTK 

3070 3O90 3110 

G CC AACGAC AC CTCA.G AGG AAATGAGGCATCGATTTAGAC AACTGG ATACAAAG CTT AAT 
ANDTSEEMRHRFRQLDTKLN 

3130 3150 
GATCTCAAGGGTCTACTGAAAGAGATTGCTAATAAAATCAAATAG 
DLKGLLKEIANKIK* 



b.) TrplO protein: 

MKSFLPVHTIVLIRENVCKCGYAQSQHMEGTQINQSEKWNYKKHTKEFPTDAFGDXQFETLGKKGKYIRLSCDTDAEIL^ 

ELLTQHWHLKTPNLVISVTGGAKNFALKPRMRKI^ 

AIGIAAWGMVSNRDTLIRNCDAEGYFI^QYLTODFTF^ 

QDSNYGGKIPIVCFAQGGGKETLKAIOTSIKNKIPCVWEGSGQIAPVIASIiVEVEDALTSSAVKEK^ 
EEETESWIKWLKEILECSHLLTVIKMEEAGDEIVSNAIS 
RRWESADLQEVMFTALIKDRPKFVRLFIjENGLNIiRKFLTHDVLTE 
FRRGFRKEDRNGRDEMDIELHDVSPITRHPLQAL^ 

AGESEELANEYETRAVELFTECYSSDEDLAEQLLVYSCEAWGGSNCLEIiAVEATDQHFIAQPGVQNFLSKQWYGEISRDT 
KNWKI ILCLFI I PLVGCGFVS FRKKPVDKHKKLIjW YYVAFFTS PFWFS WNWFYIAFLLLFAYVLLMDFHSVPHPPELV 
LYSLVFVLFCDEVRQWYVNGVNYFTDLVWV^ 
RNLGPKIIMLQRMLIDVFFFLFLFAVWMVAFG 

FTGNE S KPL C VE LD EHNli PR F PEW I T I PL VC I YML S TN I IiLi VNIjIj VAM FG YT VGTVQ ENNDQ VW KFQR YF L VQE YC S RLN 

IPFPFIVPAYFYMVVKKCFKCCCKEKfc^ESSVCCFKNED^ 

DLKGLLKEIANKIK 
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The Trp8 Gene is expre a in endometrial or uteri 
in normal endometrium 

Endometrial cancer: 
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Expression of human Trp 9 ana b rp 1 u j 
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Fig. 14 

Expression of Trpl 0 transcripts and Trpl O-antisense Transcripts 
in human prostate cancer and in malignant melanoma 
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Trp8, Trp9 and TrplO, novel markers for cancer 



FIELD OF THE INVENTION 

The present invention relates to gene expression in normal cells and cells of malignant 
tumors and particularly to novel markers associated with cancer, TrpS, Trp9 and Tip 10, and 
the genes encoding Trp8, Trp9 and TrplO 

B ACKROUND OF THE TECHNOLOGY 

Prostate cancer is one of the most common diseases of older men world wide. Diagnosis and 
monitoring of prostate cancer is difficult because of the heterogeneity of the disease. For 
diagnosis different grades of malignancy can be distinguished according to the Gleason-Score 
Diagnosis. For this diagnosis a prostate tissue sample is taken from the patient by biopsy and 
the morphology of the tissue is investigated. However, this approach only yields subjective 
results depending on the experience of the pathologist. For confirmation of these results and 
for obtaining an early diagnosis an additional diagnostic method can be applied which is 
based on the detection of a prostate specific antigen (PSA). PSA is assayed in serum samples, 
blood samples etc. using an anti-PSA-antibody. However, since in principle PSA is also 
expressed in normal prostate tissue there is a requirement for the definition of a threshold 
value (about 4 ng/ml PSA) in order to be able to distinguish between normal and malign 
prostate tissue. Unfortunately, this diagnostic method is quite insensitive and often yields 
false-positive results. Moreover, by using this diagnostic method any conclusions as regards 
the grade of malignancy, the progression of the tumor and its potential for metastasizing 
cannot be drawn. Thus, the use of molecular markers would be helpful to distinguish benign 
from malign tissue and for grading and staging prostate carcinoma, particularly for patients 
with metastasizing prostate cancer having a very bad prognosis. 

The above discussed limitations and failings of the prior art to provide meaningful specific 
markers which correlate with the presence of prostate tumors, in particular metastasizing 
tumors, has created a need for markers which can be used diagnostically, prognostically and 
therapeutically over the course of this disease. The present invention fulfils such a need by the 
provision of Tpr8, Trp9 and TrplO and the genes encoding Trp8, Trp9 and TrplO: The genes 
encoding Trp8 and TrplO are expressed in prostate carcinoma and prostatic metastasis, but 
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not in normal prostate, benign hyperplasia (BHP) and intraepithelial prostatic neoplasia (PIN). 
Furthermore, expression of Tip 10 transcripts is detectable in carcinoma but not in healthy 
tissue of the lung, the prostate, the placenta and in melanoma. 

SUMMARY OF THE INVENTION 

The present invention is based on the isolation of genes encoding novel markers associated 
witha cancer, Trp8, Trp9 and TrplO. The new calcium channel proteins Trp8, Trp9 and TrplO 
are members of the trp (transient receptor potential) - family, isolated from human placenta 
(Trp8a and Trp8b) and humane prostate (Trp9, Trp 10a and Trp 10b). Trp proteins belong to a 
steadily growing family of Ca 2+ selective and non selective ion channels. In the recent years 
seven Trp proteins (trpl -trp7) have been identified and suggested to be involved in cation 
entry, receptor operated calcium entry and pheromone sensory signaling. Structurally related 
to the trp proteins are the vanilloid receptor (VR1) and the vanilloid like receptor (VRL-1) 
both involved in nociception triggered by heat. Furthermore, two calcium permeable channels 
were identified in rat small intestine (CaTl) and rabbit kidney (ECaC). These distantly related 
channels are suggested to be involved in the uptake of calcium ions from the lumen of the 
small intestine (CaTl) or in the reuptake of calcium ions in the distal tubule of the kidney 
(ECaC). Common features or the Trp and related channels are a proposed structure 
comprising six transmembrane domains including several conserved amino acid motifs. In the 
present invention the cloning and expression of a CaTl like calcium channel (Trp8) from 
human placenta as well as Trp9 and TrplO (two variants, Trp 10a and Trp 10b) is described. 
Two polymorphic variants of the Trp8 cDNA were isolated from placenta (TrpSa and TrpSb). 
Transient expression of the Trp8b cDNA in HEK (human embryonic kidney) cells results in 
cytosolic calcium overload implicating that the Trp8 channel is constitutive open in the 
expression system. Trp8 induces highly calcium selective inward currents in HEK cells. The 
C -terminus of the TrpS protein binds calmodulin in a calcium dependent manner. The Trp9 
channel is expressed in trophoblasts and syncytiotrophoblasts of placenta and in pancreatic 
acinar cells. Furthermore, the Trp8 channel is expressed in prostatic carcinoma and prostatic 
metastases, but not in normal tissue of the prostate. No expression of Trp8 transcripts is 
detectable in benign prostatic hyperplasia (BPH) or prostatic intraepithelial neoplasia (PIN). 
Therefore, the Trp8 channel is exclusively expressed in malign prostatic tissues and serves as 
molecular marker for prostate cancer. From the experimental results it is also apparent that the 

2 



) 0210382A2JA> 



WO 02/010382 PCT/EP01/08309 

modulation of Trp8 and/or TrplO, e.g. the inhibition of expression or activity , is of 
therapeutic interest, e.g. for the prevention of tumor progression. 



The present invention, thus, provides a Trp8, Trp9 and TrplO protein, respectively, as well as 
nucleic acid molecule encoding the protein and, moreover, an antisense RNA, a ribozyme and 
an inhibitor, which allow to inhibit the expression or the activity of Trp8, Trp9 and/or TrplO. 

In one embodiment, the present invention provides a diagnostic method for detecting a 
prostate cancer or endometrial cancer (cancer of the uterus) associated with Trp8 or TrplO in 
a tissue of a subject, comprising contacting a sample containing Trp8 and/or TrplO encoding 
mRNA with a reagent which detects Trp8 and/or TrplO or the corresponding mRNA. 

In a further embodiment, the present invention provides a diagnostic method for detecting a 
melanoma, chorion carcinoma, cancer of the lung and of the prostate in a tissue of a subject, 
comprising contacting a sample with a reagent which detects TrplOa and/or TrplOb antisense 
transcripts or TrplOa and/or TrplOb related antisense transcripts. 

In another embodiment, the present invention provides a method of treating a prostate tumor, 
carcinoma of the lung, carcinoma of the placenta (chorion carcinoma) or melanoma associated 
with Trp8 and/or TrplO, comprising administering to a subject with such an disorder a 
therapeutically effect amount of a reagent which modulates, e.g. inhibits, expression of Trp8 
and/or TrplO or the activity of the protein, e.g. the above described compounds. 

Finally, the present invention provides a method of gene therapy comprising introducing into 
cells of a subject an expression vector comprising a nucleotide sequence encoding the above 
mentioned antisense RNA or ribozyme, in operable linkage with a promoter. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: A, phylogenetic relationship of trp and related proteins. B, hydropathy plot of the 
Trp8 protein sequence according to Kyte and Doolittle. C, alignment of Trp8a^ to the 
epithelial calcium channels ECaC (from rabbit) and Vrl (from rat). Putative transmembrane 
domains are underlined. 
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Figure 2: A, polymorphism of the Trp8 gene. The polymorphic variants Trp8a and Trp8b 
differ in five base pairs resulting in three amino acid exchanges in the derived protein 
sequences. Specific primers were derived from the Trp8 gene as indicated by arrows. B, the 
Trp8a and Trp8b genes are distinguishable by a single restriction site. Genomic fragments of 
the Trp8 gene can be amplified using specific primers (shown in A). The genomic fragment of 
the Trp8b gene contains an additional site of the restriction enzyme BSP 12861 (B). C, the 
Trp8 gene is located on chromosome 7. D, genotyping of eleven human subjects. A 458 bp 
genomic fragment of the Trp8 gene was amplified using specific primers (shown in A) and 
restricted with BSP 12861. The resulting fragments were analyzed by PAGE electrophoresis. 

Figure 3: The Trp8b protein is a calcium selective ion channel. A, representative trace of a 
pdiTrp8b transfected HEK 293 cell. Trp8b mediated currents are activated by voltage ramps 
(-100 mV - +100 mV) of 100 msec at -40 mV or +70 mV holding potential. 1, Trp8b currents 
in the presence at 2mm [Ca 2+ ] 0 ;2, effect of solution switch alone 3, switch to nominal zero 
calcium solution. B, Trp8b currents in the presence of zero divalent cations. C, current voltage 
relationship of the currents shown in A. Inset, leak subtracted current. D, current voltage 
relationship of the current shown in B. E, statistics of representative experiments. Black: Trp8 
transfected cells, gray: control cells. Columns from left to right: Trp8 currents at - 40 mV 
(n=12) and +70mV holding potential (n=12). Trp8 currents in standard bath solution 
including 120 mM NMDG without sodium (n = 7) and with nominal zero calcium ions (n = 8) 
or in the presence of ImM EGTA with zero divalent cations (n = 6). F, representative changes 
in [Ca 2+ ]i in Trp8b transfected HEK cells (gray) and controls (black) in the presence or 
absence of ImM [Ca 2+ ] 0 . Inset, relative increase of cytosolic calcium concentration of Trp8b 
transfected HEK cells, before and after readdition of 1 mM [Ca 2+ ] 0 in comparison to control 
cells. 

Figure 4 : The C-terminal region of the Trp8 protein binds calmodulin. A, N- and C-terminal 
fragments of the TrpS protein used for calmodulin binding studies. B, the Trp8 protein and a 
truncated TrpS protein which was in vitro translated after Muni cut of the cDNA, which lacks 
the C-terminal 32 amino acid residues, were in vitro translated in the presence of 35 S- 
methionine and incubated with calmodulin coupled agarose beads in the presence of 1 mM 
Ca 2+ or 2mM EGTA. C, calmodulin binding to N- and C-terminal fragments of the 
Trp8protein in the presence of Ca 2+ (1 mM) or EGTA (2 mM) 
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Figure 5: Expression pattern of the Trp8 cDNA. A, Northern blots (left panels, Clontech, Palo 
Alto) were hybridized using a 348 bp NcoI/BamHT fragment of the Trp9 cDNA. The probe 
hybridizes to mRNA species isolated from the commercial blot, but not to mRNA species 
isolated from benign prostate hyperplasia (right panel, mRNA isolated from 20 human 
subjects with benign prostate hyperplasia). B,C, in situ hybridization with biotinylated Trp8 
specific oligonucleotides on slides of human tissues. Left column antisense probes, right 
column sense probes. D, antinsense probes. 

Figure 6: Differential expression of TrpS cDNA in human prostate. A-F, in situ 
hybridization with prostatic tissues. A, normal prostate, B, primary carcinoma, C, benign 
hyperplasia, D, rezidive carcinoma, E, prostatic intraepithelial neoplasia, F, lymphnode 
metastasis of the prostata. 

Figure 7: Trp8a cDNA sequence and derived amino acid sequence 

Figure 8: A, Trp8b cDNA sequence and derived amino acid sequence 

B, cDNA sequence of splice variant 1 (12B1) 

C, cDNA sequence of splice variant 2 (17-3) 

D, cDNA sequence of splice variant 3 (23A3) 

E, cDNA sequence of splice variant 4 (23 C3) 

Figure 9: A, Trp9 cDNA sequence and derived amino acid sequence B, cDNA sequence of 
splice variant 15 and derived amino acid sequence. 

Figure 10: A, cDNA sequence of Trp 10a and derived amino acid sequence, B, cDNA 
fragment of Trp 10a and derived amino acid sequence. 

Figure 11: cDNA sequence of Trp 10b and derived amino acid sequence. 
Figure 12: Expression of TrpS mRNA in human endometrial cancer or cancer of the uterus. A 
- D, in situ hybridization with slides of endometrial cancer hybridized with TrpS antisense 
(left column) or sense probes as controls (right column). E - F, Trp8 antisense probes 
hybridized to slides of normal endometrium. It can be clearly seen no hybridization occurs 
with normal endometrial tissue. 
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Figure 13: Expression of human Trp9 and Tip 10 genes 

Northern blots were hybridized using Trp9 (upper panel) or TrplO (lower panel) specific 
probes. Expression of the Trp9 cDNA is detectable in many tissues including human prostate 
and colon as well as in benign prostatic hyperplasia. Expression of TrplO cDNA is detectable 
in human prostate of a commercial northern blot (Clontech, right side). This Northern blot 
contains prostatic tissue collected from 1 5 human subjects in the range of 14 - 60 years of age. 
No expression of TrplO cDNA was detectable in benign prostatic hyperplasia (left side). 

Figure 14 ; Expression of TrplO transcripts and TrplO-antisense transcripts in human prostate 
cancer and metastasis of a melanoma. In situ hybridizations of slides hybridized with TrplO- 
antisense (A-E, K-N) and TrplO related sense probes (F-J, P-R). It can clearly be seen that 
both probes detect the same cancer cells indicating that these cancer cells express TrplO 
transcripts as well as TrplO-antisense transcripts. S, no TrplO expression is detectable in 
benign hyperplasia of the prostate (BPH). O and T, show expression of TrplO transcripts (O) 
and TrplO-antisense transcripts (T) in a metastasis of a melanoma in human lung. Melanoma 
cancer cells express both TrplO transcripts and TrplO-antisense transcripts. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated nucleic acid molecule encoding the human 
prostate carcinoma associated protein Trp8a, TrpSb, Trp9, TrplOa or TrplOb or a protein 
exhibiting biological properties of Trp8a, Trp8b, Trp9, TrplOa or TrplOb and being selected 
from the group consisting of 

(a) a nucleic acid molecule encoding a protein that comprises the amino acid sequence 
depicted in Figure 7, 8 A, 9,10 or 11; 

(b) a nucleic acid molecule comprising the nucleotide sequence depicted in Figure 7, 8A, 
9,10, or 11; 

(c) a nucleic acid molecule included in DSMZ Deposit no. DSM 13579 (deposit date: 28 
June 2000), DSM 13580 (deposit date: 28 June 2000), DSM 13584 (deposit date: 5 
July 2000), DSM 13581 (deposit date: 28 June 2000) or DSM ....(deposit date:....); 

(d) a nucleic acid molecule with hybridizes to a nucleic acid molecule specified in (a) to 
(c) 
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(e) a nucleic acid molecule the nucleic acid sequence of which deviates from the nucleic 
sequences specified in (a) to (d) due to the degeneration of the genetic code; and 

(f) a nucleic acid molecule, which represents a fragment, derivative or allelic variation of 
a nucleic acid sequence specified in (a) to (e). 

As used herein, a protein exhibiting biological properties of Trp8a, Trp8b, Trp9,Trpl0a or 
Tip 10b is understood to be a protein having at least one of the activities as illustrated in the 
Examples, below. 

As used herein, the term „isolated nucleic acid molecule,, includes nucleic acid molecules 
substantially free of other nucleic acids, proteins, lipids, carbohydrates or other materials with 
which it is naturally associated. 

In a first embodiment, the invention provides an isolated nucleic acid molecule encoding the 
human prostate carcinoma associated protein TrpSa, TrpSb, Trp9,Trpl0a or TrplOb 
comprising the amino acid sequence depicted in Figure 7, 8 A, 9,10 or 11. The present 
invention also provides a nucleic acid molecule comprising the nucleotide sequence depicted 
in Figure 7, 8A, 9,10 or 11. 



The present invention provides not only the generated nucleotide sequence identified in 
Figure 7, 8 A, 9,10 or 11, respectively and the predicted translated amino acid sequence, 
respectively, but also plasmid DNA containing a Trp8a cDNA deposited with the DSMZ, 
under DSM 13579, a Trp8b cDNA deposited with the DSMZ, under DSM 13580, a Trp9 
cDNA deposited with the DSMZ, under DSM 13584, a TrplOa cDNA deposited with the 
DSMZ, under DSM 13581, and a TrplOb cDNA deposited with the DSMZ, under DSM...., 
respectively. The nucleotide sequence of each deposited Trp-clone can readily be determined 
by sequencing the deposited clone in accordance with known methods. The predicted amino 
acid sequence can then be verified from such deposits. Moreover, the amino acid sequence of 
the protein encoded by each deposited clone can also be directly determined by peptide 
sequencing or by expressing the protein in a suitable host cell containing the deposited Tip- 
encoding DNA, collecting the protein, and determining its sequence. 

The nucleic acid molecules of the invention can be both DNA and RNA molecules. Suitable 
DNA molecules are, for example, genomic or cDNA molecules. It is understood that all 
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nucleic acid molecules encoding all or a portion of Trp8a, TrpSb, Trp9,Trpl0a or TrplOb are 
also included, as long as they encode a polypeptide with biological activity. The nucleic acid 
molecules of the invention an be isolated from natural sources or can be synthesized 
according to know methods. 

The present invention also provides nucleic acid molecules which hybridize to the above 
nucleic acid molecules. As used herein, the term „hybridize„ has the meaning of hybridization 
under conventional hybridization conditions, preferably under stringent conditions as 
described, for example, in Sambrook et al., Molecular Cloning, A Laboratory Manual 2 nd 
edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Also 
contemplated are nucleic acid molecules that hybridize to the Trp nucleic acid molecules at 
lower stringency hybridization conditions. Changes in the stringency of hybridization and 
signal detection are primarily accomplished through the manipulation of formamide 
concentration (lower percentages of formamide result in lowered stringency), salt conditions, 
or temperature. For example, lower stringency conditions include an overnight incubation at 
37°Cin a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 9.2M NaH 2 P0 4 ; 0.02M 
EDTA, pH7.4), 0.5% SDS, 30% formamide, 100 ng/ml salmon sperm blocking DNA, 
following by washes at 50°C with 1 X SSPE, 0.1% SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). Variations in the above conditions may be accomplished 
through the inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. The inclusion of specific blocking reagents may 
require modification of the hybridization conditions described above, due to problems with 
compatibility. 

Nucleic acid molecules that hybridize to the molecules of the invention can be isolated, e.g., 
from genomic or cDNA libraries that were produced from human cell lines or tissues. In order 
to identify and isolate such nucleic acid molecules the molecules of the invention or parts of 
these molecules or the reverse complements of these molecules can be used, for example by 
means of hybridization according to conventional methods (see, e.g., Sambrook et al., supra). 
As a hybridization probe nucleic acid molecules can be used, for example, that have exactly 
or basically the nucleotide sequence depicted in Figure 7, SA, 9,10 or 11, respectively, or 
parts of these sequences. The fragments used as hybridization probe can be synthetic 
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fragments that were produced by means of conventional synthetic methods and the sequence 
of which basically corresponds to the sequence of a nucleic acid molecule of the invention. 



The nucleic acid molecules of the present invention also include molecules with sequences 
that are degenerate as a result of the genetic code. 

In a further embodiment, the present invention provides nucleic acid molecules which 
comprise fragments, derivatives and allelic variants of the nucleic acid molecules described 
above encoding a protein of the invention. „Fragments„ are understood to be parts of the 
nucleic acid molecules that are long enough to encode one of the described proteins. These 
fragments comprise nucleic acid molecules specifically hybridizing to transcripts of the 
nucleic acid molecules of the invention. These nucleic acid molecules can be used, for 
example, as probes or primers in the diagnostic assay and/or kit described below and, 
preferably, are oligonucleotides having a length of at least 10, in particular of at least 15 and 
particularly preferred of at least 50 nucleotides. The nucleic acid molecules and 
oligonucleotides of the invention can also be used, for example, as primers for a PCR 
reaction. Examples of particular useful probes (primers) are shown in Tables 1 and 2. 

Table 1 

TrpS probes used for in situ hybridization: 
Probes (antisense) 

1. ) 5' TCCGCTGCCGGTTGAGATCTTGCC 3' 

2. ) 5' CTTGCTCCATAGGCAGAGAATTAG 3' 

3. ) 5' ATCCTC AGAGCCCCGGGTGTGGAA3 9 

Controls (sense) 

1. ) 5' GGCAAGATCTCAACCGGCAGCGGA 3' 

2. ) 5' CTAATTCTCTGCCTATGGAGCAAG 3' 

3. ) 5' TTCCAC ACCCGGGGCTCTGAGGAT 3 5 

Tabelle 2 

TrplO probes used for the in situ hybridizations shown in Figure 14: 
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Probes (antisense) 

1. ) 5' GCTTCCACCCCAAGCTTCACAGGAATAGA 3' (Figure 14 A, 14B) 

2. ) 5' GGCGATGAAATGCTGGTCTGTGGC 3' (Figure 14C, 14D, 14N, 14S, 140) 

3. ) 5' ATCTTCCAGTTCTTGGTGTCTCGG 3' (Figure 14E, 14K) 

4. ) 5' GCTGCAGTACTCCTGCACCAGGAA 3' (Figure 14L, 14M) 

Probes (sense) 

L) 5' TCTATTCCTGTGAAGCTTGGGGTGGAAGC 3 5 (Figure 14F, 14G) 

2. ) 5' GCCACAGACCAGCATTTCATCGCC 3' (Figure 14H, 141, 14T) 

3. ) 5' CCGAGACACCAAGAACTGGAAGAT 3' (Figure 14J, 14P) 

4. ) 5' TTCCTGGTGCAGGAGTACTGCAGC V (Figure 14Q, 14R) 

The term „derivative„ in this context means that the sequences of these molecules differ from 
the sequences of the nucleic acid molecules described above at one or several positions but 
have a high level of homology to these sequences. Homology hereby means a sequence 
identity of at least 40%, in particular an identity of at least 60%, preferably of more than 80% 
and particularly preferred of more than 90%. These proteins encoded by the nucleic acid 
molecules have a sequence identity to the amino acid sequence depicted in Figure 7, 8 A, 9, 10 
and 11, respectively, of at least 80%, preferably of 85% and particularly preferred of more 
than 90%, 97% and 99%. The deviations to the above-described nucleic acid molecules may 
have been produced by deletion, substitution, insertion or recombination. The definition of the 
derivatives also includes splice variants, e.g. the splice variants shown in Figures 8B to 8E 
and9B. 

The nucleic acid molecules that are homologous to the above-described molecules and that 
represent derivatives of these molecules usually are variations of these molecules that 
represent modifications having the same biological function. They can be naturally occurring 
variations, for example sequences from other organisms, or mutations that can either occur 
naturally or that have been introduced by specific mutagenesis. Furthermore the variations can 
be synthetically produced sequences. The allelic variants can be either naturally occurring 
variants or synthetically produced variants or variants produced by recombinant DNA 
processes. 
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Generally, by means of conventional molecular biological processes it is possible (see, e.g., 
Sambrook et al, supra) to introduce different mutations into the nucleic acid molecules of the 
invention. As a result Trp proteins or Trp related proteins with possibly modified biological 
properties are synthesized. One possibility is the production of deletion mutants in which 
nucleic acid molecules are produced by continuous deletions from the 5 9 - or 3 5 -terminal of the 
coding DNA sequence and that lead to the synthesis of proteins that are shortened 
accordingly. Another possibility is the introduction of single-point mutation at positions 
where a modification of the amino aid sequence influences, e.g., the ion channel properties or 
the regulations of the trp-ion channel. By this method muteins can be produced, for example, 
that possess a modified ion conducting pore, a modified K m -value or that are no longer subject 
to the regulation mechanisms that normally exist in the cell, e.g. with regard to allosteric 
regulation or covalent modification. Such muteins might also be valuable as therapeutically 
useful antagonists of Trp8a, Trp8b, Trp9,Trpl0a or TrplOb, respectively. 

For the manipulation in prokaryotic cells by means of genetic engineering the nucleic acid 
molecules of the invention or parts of these molecules can be introduced into plasmids 
allowing a mutagenesis or a modification of a sequence by recombination of DNA sequences. 
By means of conventional methods (cf. Sambrook et al., supra) bases can be exchanged and 
natural or synthetic sequences can be added. In order to link the DNA fragments with each 
other adapters or linkers can be added to the fragments. Furthermore, manipulations can be 
performed that provide suitable cleavage sites or that remove superfluous DNA or cleavage 
sites. If insertions, deletions or substitutions are possible, in vitro mutagenesis, primer repair, 
restriction or ligation can be performed. As analysis method usually sequence analysis, 
restriction analysis and other biochemical or molecular biological methods are used. 

The proteins encoded by the various variants of the nucleic acid molecules of the invention 
show certain common characteristics, such as ion channel activity, molecular weight, 
immunological reactivity or conformation or physical properties like the electrophoretical 
mobilty, chromatographic behavior, sedimentation coefficients, solubility, spectroscopic 
properties, stability; pH optimum, temperature optimum. 

The invention furthermore relates to vectors containing the nucleic acid molecules of the 
invention. Preferably, they are plasmids, cosmids, viruses, bacteriophages and other vectors 
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usually used in the field of genetic engineering. Vectors suitable for use in the present 
invention include, but are not limited to the T7-based expression vector for expression in 
mammalian cells and baculovirus-derived vectors for expression in insect cells. Preferably, 
the nucleic acid molecule of the invention is operatively linked to the regulatory elements in 
the recombinant vector of the invention that guarantee the transcription and synthesis of an 
RNA in prokryotic and/or eukaryotic cells that can be translated. The nucleotide sequence to 
be transcribed can be operably linked to a promoter like a T7, metallothionein I or polyhedrin 
promoter. 

In a further embodiment, the present invention relates to recombinant host cells transiently or 
stable containing the nucleic acid molecules or vectors or the invention. A host cell is 
understood to be an organism that is capable to take up in vitro recombinant DNA and, if the 
case may be, to synthesize the proteins encoded by the nucleic acid molecules of the 
invention. Preferably, these cells are prokaryotic or eukaryotic cells, for example mammalian 
cells, bacterial cells, insect cells or yeast cells. The host cells of the invention are preferably 
characterized by the fact that the introduced nucleic acid molecule of the invention either is 
heterologous with regard to the transformed cell, i.e. that it does not naturally occur in these 
cells, or is localized at a place in the genome different from that of the corresponding 
naturally occurring sequence. 

A further embodiment of the invention relates to isolated proteins exhibiting biological 
properties of the human prostate carcinoma associated protein Trp8a, Trp8b, Trp9,Trpl0a or 
TrplOb and being encoded by the nucleic acid molecules of the invention, as well as to 
methods for their production, whereby, e.g., a host cell of the invention is cultivated under 
conditions allowing the synthesis of the protein and the protein is subsequently isolated from 
the cultivated cells and/or the culture medium. Isolation and purification of the recombinantly 
produced proteins may be carried out by conventional means including preparative 
chromatography and affinity and immunological separations involving affinity with an anti- 
Trp8a-, anti-Trp8b-, anti-Trp9-,anti-Trpl0a- or anti-Trpl Ob-antibody, respectively. 

As used herein, the term „isolated protein,, includes proteins substantially free of other 
proteins, nucleic acids, lipids, carbohydrates or other materials with which it is naturally 
associated. Such proteins however not only comprise recombinantly produced proteins but 
include isolated naturally occurring proteins, synthetically produced proteins, or proteins 
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produced by a combination of these methods. Means for preparing such proteins are well 
understood in the art. The Trp proteins are preferably in a substantially purified form. A 
recombinantly produced version of a human prostate carcinoma associated protein Trp8a, 
TrpSb, Trp9,Trpl0a or TrplOb protein, including the secreted protein, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67; 31-40 (1988). 

In a further preferred embodiment, the present invention relates to an antisense RNA 
sequence characterised that it is complementary to an mRNA transcribed from a nucleic acid 
molecule of the present invention or a part thereof and can selectively bind to said mRNA, 
said sequence being capable of inhibiting the synthesis of the protein encoded by said nucleic 
acid molecules, and a ribozyme characterised in that it is complementary to an mRNA 
transcribed from a nucleic acid molecule of the present invention or a part thereof and can 
selectively bind to and cleave said mRNA, thus inhibiting the synthesis of the proteins 
encoded by said nucleic acid molecules. Riboyzmes which are composed of a single RNA 
chain are RNA enzymes, i.e. catalytic RNAs, which can intermolecularly cleave a target 
RNA, for example the mRNA transcribed from one of the Trp genes. It is now possible to 
construct ribozymes which are able to cleave the target RNA at a specific site by following 
the strategies described in the literature, (see, e.g., Tanner et al., in: Antisense Research and 
Applications, CRC Press Inc. (1993), 415-426). The two main requirements for such 
ribozymes are the catalytic domain and regions which are complementary to the target RNA 
and which allow them to bind to its substrate, which is a prerequisite for cleavage. Said 
complementary sequences, i.e., the antisense RNA or ribozyme, are useful for repression of 
Trp8a-, TrpSb, Trp9-,Trpl0a- and Trp 1 Ob-expression, respectively, i.e. in the case of the 
treatment of a prostate cancer or endometrial cancer (carcinoma of the uterus). Preferably, the 
antisense RNA and ribozyme of the invention are complementary to the coding region. The 
person skilled in the art provided with the sequences of the nucleic acid molecules of the 
present invention will be in a position to produce and utilise the above described antisense 
RNAs or ribozymes. The region of the antisense RNA and ribozyme, respectively, which 
shows complementarity to the mRNA transcribed from the nucleic acid molecules of the 
present invention preferably has a length of at least 10, in particular of at least 15 and 
particularly preferred of at least 50 nucleotides. 

In still a further embodiment, the present invention relates to inhibitors of Trp8a, Trp8b, Trp9, 
Trp 10a and TrplOb, respectively, which fulfill a similar purpose as the antisense RNAs or 
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ribozymes mentioned above, i.e. reduction or elimination of biologically active TrpSa, Trp8b, 
Trp9, TrplOa or TrplOb molecules. Such inhibitors can be, for instance, structural analogues 
of the corresponding protein that act as antagonists. In addition, such inhibitors comprise 
molecules identified by the use of the recombinantly produced proteins, e.g. the 
recombinantly produces protein can be used to screen for and identify inhibitors, for example, 
by exploiting the capability of potential inhibitors to bind to the protein under appropriate 
conditions. The inhibitors can, for example, be identified by preparing a test mixture wherein 
the inhibitor candidate is incubated with Trp8a, Trp8b, Trp9, TrplOa or TrplOb, respectively, 
under appropriate conditions that allow Trp8a, Trp8b, Trp9, TrplOa or TrplOb to be in a 
native conformation. Such an in vitro test system can be established according to methods 
well known in the art. Inhibitors can be identified, for example, by first screening for either 
synthetic or naturally occurring molecules that bind to the recombinantly produced Trp 
protein and then, in a second step, by testing those selected molecules in cellular assays for 
inhibition of the Trp protein, as reflected by inhibition of at least one of the biological 
activities as described in the examples, below. Such screening for molecules that bind Trp8a, 
TrpSb, Trp9, TrplOa or TrplOb could easily performed on a large scale, e.g. by screening 
candidate molecules from libraries of synthetic and/or natural molecules. Such an inhibitor is, 
e.g., a synthetic organic chemical, a natural fermentation product, a substance extracted from 
a microorganism, plant or animal, or a peptide. Additional examples of inhibitors are specific 
antibodies, preferably monoclonal antibodies. Moreover, the nucleic sequences of the 
invention and the encoded proteins can be used to identify further factors involved in tumor 
development and progression. In this context it should be emphasized that the modulation of 
the calcium channel of a member of the trp family can result in the stimulation of the immune 
response of T lymphocytes leading to proliferation of the T lymphocytes. The proteins of the 
invention can, e.g., be used to identify further (unrelated) proteins which are associated with 
the tumor using screening methods based on protein/protein interactions, e.g. the two-hybrid- 
system Fields, S. and Song, O. (1989) Nature (340): 245-246. 

The present invention also provides a method for diagnosing a prostate carcinoma which 
comprises contacting a target sample suspected to contain the protein TrpSa, TrpSb, TrplOa 
and/or TrplOb or the Trp8a, Trp8b, TrplOa and/or TrplOb encoding mRNA with a reagent 
which reacts with Trp8a, TrpSb, TrplOa and/or TrplOb or the Trp8a, Trp8b, TrplOa and/or 
TrplOb encoding mRNA and detecting Trp8a, Trp8b, TrplOa and/or TrplOb or Trp8a, TrpSb, 
TrplOa and/or Trp 10b encoding mRNA. 

14 



0210382A2_IA> 



WO 02/010382 



PCT/EPO 1/08309 



It has been found that carcinoma cells of placenta (chorion carcinoma), lung and prostate 
express TrplO transcripts as well as TrplO antisense transcripts and transcripts being in part 
complementary to TrplO antisense transcripts. Accordingly, the present invention also 
provides a method for diagnosing a melanoma, chorion carcinoma, cancer of the lung and of 
the prostate in a tissue of a subject, comprising contacting a sample with a reagent which 
detects TrplOa and/or TrplOb antisense RNA. 

When the target is mRNA (or antisense RNA), the reagent is typically a nucleic acid probe or 
a primer for PCR. The person skilled in the art is in a position to design suitable nucleic acids 
probes based on the information as regards the nucleotide sequence of Trp8a, Trp8b, TrplOa 
or TrplOb as depicted in figure 7, Sa, 10 and 11, respectively, or tables 1 and 2, above. When 
the target is the protein, the reagent is typically an antibody probe. The term „antibody", 
preferably, relates to antibodies which consist essentially of pooled monoclonal antibodies 
with different epitopic specifities, as well as distinct monoclonal antibody preparations. 
Monoclonal antibodies are made from an antigen containing fragments of the proteins of the 
invention by methods well known to those skilled in the art (see, e.g., Kohler et al., Nature 
256 (1975), 495). As used herein, the term „antibody" (Ab) or „monoclonal antibody" (Mab) 
is meant to include intact molecules as well as antibody fragments (such as, for example, Fab 
and F(ab') 2 fragments) which are capable of specifically binding to protein. Fab and f(ab')2 
fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, and 
may have less non-specific tissue binding than an intact antibody. (Wahl et al., J. Nucl. Med. 
24: 316-325 (1983)). Thus, these fragments are preferred, as well as the products of a FAB or 
other immunoglobulin expression library. Moreover, antibodies of the present invention 
include chimerical, single chain, and humanized antibodies. The target cellular component, 
i.e. Trp8a, TrpSb, TrplOa and/or TrplOb or Trp8a, TrpSb, TrplOa and/or TrplOb encoding 
mRNA or TrplOa/b antisense transcripts, e.g., in biological fluids or tissues, may be detected 
directly in situ, e.g. by in situ hybridization (e.g., according to the examples, below) or it may 
be isolated from other cell components by common methods known to those skilled in the art 
before contacting with a probe. Detection methods include Northern blot analysis, RNase 
protection, in situ methods, e.g. in situ hybridization, in vitro amplification methods (PCR, 
LCR, QRNA replicase or RNA-transcription/amplification (TAS, 3SR), reverse dot blot 
disclosed in EP-B1 O 237 362)), immunoassays, Western blot and other detection assays that 
are known to those skilled in the art. 
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Products obtained by in vitro amplification can be detected according to established methods, 
e.g. by separating the products on agarose gels and by subsequent staining with ethidium 
bromide. Alternatively, the amplified products can be detected by using labeled primers for 
amplification or labeled dNTPs. 

The probes can be detectable labeled, for example, with a radioisotope, a bioluminescent, 
compound, a chemiluminescent compound, a fluorescent compound, a metal chelate, or an 
enzyme. 

Expression of Trp8a, Trp8b, TrplOa and TrplOb, respectively, in tissues can be studied with 
classical immunohistological methods (Jalkanen et al., J. Cell. Biol. 101 (1985), 976-985; 
Jalkanen et al., J. Cell. Biol. 105 (1987), 3087-3096; Sobol et al. Clin. Immunpathol. 24 
(1982), 139-144; Sobol et al., Cancer 65 (1985), 2005-2010). Other antibody based methods 
useful for detecting protein gene expression include immunoassays, such as the enzyme 
linked immunosorbent assay (ELISA) and the radioimmunoassay (RIA). Suitable antibody 
assay labels are known in the art and include enzyme labels, such as, glucose oxidase, and 
radioisotopes, such as iodine ( 125 I, m I), carbon ( 14 C), sulfur ( 35 S), tritium ( 3 H), indium ( m In), 
and technetium rhodamine, and biotin. In addition to assaying Trp8a, Trp8b, Trp 10a or 
TrplOb levels in a biological sample, the protein can also be detected in vivo by imaging. 
Antibody labels or markers for in vivo imaging of protein include those detectable by X- 
radiography, NMR or ESR. For X-radiography, suitable labels include radioisotopes such as 
barium or cesium, which emit detectable radiation but are not overtly harmful to the subject. 
Suitable markers for NMR and ESR include those with a detectable characteristic spin, such 
as deuterium, which may be incorporated into the antibody by labeling of nutrients for the 
relevant hybridoma. A protein-specific antibody or antibody fragment which has been labeled 
with an appropriate detectable imaging moiety, such as a radioisotope (for example, 131 I, 112 In, 
99 mTc), a radio-opaque substance, or a material detectable by nuclear magnetic resonance, is 
introduced (for example, parenterally, subcutaneously, or intraperitoneally) into the mammal. 
It will be understood in the art that the size of the subject and the imaging system used will 
determine the quantity of imaging moiety needed to produce diagnostic images. In the case of 
a radioisotope moiety, for a human subject, the quantity of radioactivity injected will normally 
range from about 5 to 20 millicuries of "mTc. The labeled antibody or antibody fragment will 
then preferentially accumulate at he location of cells which contain the specific protein. In 
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vivo tumor imaging is described in S.W. Burchiel et al., „Immunopharmacokinetics of 
Radiolabeled Antibodies and Their Fragments". (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer, S.W. Burchiel and B.A. Rhodes, eds., Masson 
Publishing Inc. (1982)). 

The marker Trp8a and TrpSb is also useful for prognosis, for monitoring the progression of 
the tumor and the diagnostic evaluation of the degree of malignancy of a prostate tumor 
(grading and staging), e.g. by using in situ hybridization: In a primary carcinoma Trp8 is 
expressed in about 2 to 10% of carcinoma cells, in a rezidive carcinoma in about 10 to 60% of 
cells and in metastases in about 60 to 90% of cells. 

The present invention also relates to a method for diagnosing endometrial cancer (cancer of 
the uterus) which comprises contacting a target sample suspected to contain the protein Trp8a 
and/or Trp8b or the TrpSa and/or TrpSb encoding mRNA with a reagent which reacts with 
Trp8a and/or Trp8b or the encoding mRNA and detecting TrpSa and/or TrpSb encoding 
mRNA. As regards particular embodiments of this method reference is made to the particular 
embodiments of the method of diagnosing a prostate cancer outlined above. 

For evaluating whether the concentration of Trp8a, Trp8b, TrplOa or TrplOb or the 
concentration of Trp8a, Trp8b, TrplOa or TrplOb encoding mRNA is normal or increased, 
thus indicative for the presence of a malignant tumor, the measured concentration is compared 
with the concentration in a normal tissue, preferably by using the ratio of Trp8a:Trp9, 
Trp8b:Trp9 or Trpl0(a or b)/Trp9 for quantification. 

Since the prostate carcinoma forms its own basement membrane when growing invasively, it 
can be concluded that only cells expressing Trp8 and TrplO are involved in this phenomenon. 
Thus, it can be concluded that by inhibiting the expression and/or activity of these proteins an 
effective therapy of cancers like PCA is provided. 

Thus, the present invention also relates to a pharmaceutical composition containing a reagent 
which decreases or inhibits TrpSa, TrpSb, TrplOa and/or TrplOb expression or the activity of 
Trp8a, TrpSb, TrplOa and/or TrplOb, and a method for preventing, treating, or ameliorating a 
prostate tumor, endometrial cancer (uterine carcinoma) tumor, a chorion carcinoma, cancer of 
the lung or melanoma, which comprises administering to a mammalian subject a 
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therapeutically effective amount of a reagent which decreases or inhibits Trp8a, Trp8b, 
TrplOa and/or TrplOb expression or the activity of Trp8a, TrpSb, TrplOa and/or TrplOb. 
Examples of such reagents are the above described . antisense RNAs, ribozymes or inhibitors, 
e.g. specific antibodies. Furthermore, peptides, which inhibit or modulate the biological 
function of TrpSa, Trp8b, Trp9, TrplOa and/or TrplOb may be useful as therapeutical 
reagents. For example, these peptides can be obtained by screening combina torial phage 
display libraries (Cosmix, Braunschweig, Germany) as described by Rottgen, P. and Collins, 
J. (Gene (1995) 164 (2): 243-250). Furthermore, antigenic epitopes of the Trp8 and TrplO 
proteins can be identified by the expression of recombinant Trp8 and TrplO epitope libraries 
in E. coli (Marquart, A. & Flockerzi, V., FEBS Lett. 407 (1997), 137-140; Trost, C, et al., 
FEBS Lett. 451 (1999) 257-263 and the consecutive screening of these libraries with serum of 
patients with cancer of the prostate or of the endometrium. Those Trp8 and TrplO epitopes 
which are immunogenic and which lead to the formation of antibodies in the serum of the 
patients can be then be used as Trp8 or TrplO derived peptide vaccines for immune inventions 
against cancer cells which express Trp8 or TrplO. Alternatively to the E. coli expression 
system, Trp8 or TrplO or epitopes of Trp8 and TrplO can be expressed in mammalian cell 
lines such as human embryonic kidney (Hek 293) cells (American Type Culture Collection, 
ATCC CRL 1573). 

Finally, compounds useful for therapy of the above described diseases comprise compounds 
which act as antagonists or agonists on the ion channels Trp8, Trp9 and TrplO. It could be 
shown that Trp8 is a highly calcium selective ion channel which in the presence of 
monovalent (namely sodium) and divalent ions (namely calcium) is only permeable for 
calcium ions (see Example 4, below, and Figures 3 A, C, E). Under physiological conditions, 
Trp8 is a calcium selective channel exhibiting large inward currents. This very large 
conductance of Trp8 channels (as wells as Trp9 and TrplOa/b channels) is useful to establish 
systems for screening pharmacological compounds interacting with Trp-channels including 
high throughput screening systems. Useful high throughput screening systems are well known 
to the person skilled in the art and include, e.g., the use of cell lines stably or transiently 
transfected with DNA sequences encoding Trp8, Trp9 and TrplO channels in assays to detect 
calcium signaling in biological systems. Such systems include assays based on Ca-sensitive 
dyes such as aequorin, apoaequorin, Fura-2, Fluo-3 and Indo-1. 
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Accordingly, the present invention also relates to a method for identifying compounds which 
act as agonists or antagonists on the ion channels Trp8, Trp9 and/or Trp 10, said method 
comprising contacting a test compound with the ion channel Trp8, Trp9 and/or Trp 10, 
preferably by using a system based on cells stably or transiently transfected with DNA 
sequences encoding Trp8, Trp9 and/or TrplO, and determining whether said test compound 
affects the calcium uptake. 

For administration the above described reagents are preferably combined with suitable 
pharmaceutical carriers. Examples of suitable pharmaceutical carriers are well known in the 
art and include phosphate buffered saline solutions, water, emulsions, such as oil/water 
emulsions, various types of wetting agents, sterile solutions etc. Such carriers can be 
formulated by conventional methods and can be administered to the subject at a suitable dose. 
Administration of the suitable compositions may be effected by different ways, e.g. by 
intravenous, intraperetoneal, subcutaneous, intramuscular, topical or intradermal 
administration. The route of administration, of course, depends on the nature of the tumor and 
the kind of compound contained in the pharmaceutical composition. The dosage regimen will 
be determined by the attending physician and other clinical factors. As is well known in the 
medical arts, dosages for any one patient depends on many factors, including the patient's 
size, body surface area, age, sex, the particular compound to be administered, time and route 
of administration, the kind and stage of the tumor, general health and other drugs being 
administered concurrently. 

The delivery of the antisense RNAs or ribozymes of the invention can be achieved by direct 
application or, preferably, by using a recombinant expression vector such as a chimeric virus 
containing these compounds or a colloidal dispersion system. By delivering these nucleic 
acids to the desired target, the intracellular expression of Trp8a, Trp8b, TrplOa and/or Trp 1 0b 
and, thus, the level of TrpSa, TrpSb, TrplOa and/or Trp 10b can be decreased resulting in the 
inhibition of the negative effects of TrpSa, Trp8b, TrplOa and/or Trp 10b, e.g. as regards the 
metastasis formation of PC A. 

Direct application to the target site can be performed, e.g., by ballistic delivery, as a colloidal 
dispersion system or by catheter to a site in artery. The colloidal dispersion systems which can 
be used for delivery of the above nucleic, acids include macromolecule complexes, 
nanocapsules, microspheres, beads and lipid-based systems including oil-in-water emulsions 
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(mixed), micelles, liposomes and lipoplexes, The preferred colloidal system is a liposome. 
The composition of the liposome is usually a combination of phospholipids and steroids, 
especially cholesterol. The skilled person is in a position to select such liposomes which are 
suitable for the delivery of the desired nucleic acid molecule. Organ-specific or cell-specific 
liposomes can be used in order to achieve delivery only to the desired tumor. The targeting of 
liposomes can be carried out by the person skilled in the art by applying commonly known 
methods. This targeting includes passive targeting (utilizing the natural tendency of the 
liposomes to distribute to cells of the RES in organs which contain sinusoidal capillaries) or 
active targeting (for example by coupling the liposome to a specific ligand, e.g., an antibody, 
a receptor, sugar, glycolipid, protein etc., by well known methods). In the present invention 
monoclonal antibodies are preferably used to target liposomes to specific tumors via specific 
cell-surface ligands. 

Preferred recombinant vectors useful for gene therapy are viral vectors, e.g. adenovirus, 
herpes virus, vaccinia, or, more preferably, an RNA virus such as a Retrovirus. Even more 
preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples of 
such retroviral vectors which can be used in the present invention are: Moloney murine 
leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary 
tumor virus (MuMTV) and Rous sarcoma virus (RSV). Most preferably, a non-human 
primate retroviral vector is employed, such as the gibbon ape leukemia virus (GaLV), 
providing a broader host range compared to murine vectors. Since recombinant retroviruses 
are defective, assistance is required in order to produce infectious particles. Such assistance 
can be provided, e.g., by using helper cell lines that contain plasmids encoding all of the 
structural genes of the retrovirus under the control of regulatory sequences within the LTR. 
Suitable helper cell lines are well known to those skilled in the art. Said vectors can 
additionally contain a gene encoding a selectable marker so that the transduced cells can be 
identified. Moreover, the retroviral vectors can be modified in such a way that they become 
target specific. This can be achieved, e.g., by inserting a polynucleotide encoding a sugar, a 
glycolipid, or a protein, preferably an antibody. Those skilled in the art know additional 
methods for generating target specific vectors. Further suitable vectors and methods for in 
vitro- or in vivo-gene therapy are described in the literature and are known to the persons 
skilled in the art; see, e.g., WO 94/29469 or WO 97/00957. 
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In order to achieve expression only in the target organ, i.e. tumor to be treated, the nucleic 
acids encoding, e.g. an antisense KNA or ribozyme can also be operably linked to a tissue 
specific promoter and used for gene therapy. Such promoters are well known to those skilled 
in the art (see e.g. Zimmermann et al., (1994) Neuron 12, 11-24; Vidal et al.; (1990) EMBO J. 
9, 833-840; Mayford et al., (1995), Cell 81, 891-904; Pinkert et al., (1987) Genes & Dev. i, 
268-76). 

For use in the diagnostic research discussed above, kits are also provided by the present 
invention. Such kits are useful for the detection of a target cellular component, which is 
TrpSa, Trp8b, TrplOa and/or TrplOb or, alternatively, Trp8a, Trp8b, TrplOa and/or TrplOb 
encoding mRNA or TrplOa/b antisense transcripts, wherein the presence or an increased 
concentration of Trp8a, Trp8b, TrplOa and/or TrplOb or, alternatively, TrpSa, Trp8b, TrplOa 
and/or TrplOb encoding mRNA or TrplOa/b antisense transcripts is indicative for a prostate 
tumor, endometrial cancer, melanoma, chorion carcinoma or cancer of the lung, said kit 
comprising a probe for detection of Trp8a, TrpSb, Trp9, TrplOa and/or TrplOb or, 
alternatively, TrpSa, TrpSb, Trp9, TrplOa and/or TrplOb encoding mRNA or T^pl0a^ 
antisense transcripts. The probe can be detectably labeled. Such probe may be a specific 
antibody or specific oligonucleotide. In a preferred embodiment, said kit contains an anti- 
TrpSa-j anti-Trp8b-, anti-Trp9-, anti-TrplOa-and/or anti-Trpl Ob-antibody and allows said 
diagnosis, e.g., by ELISA and contains the antibody bound to a solid support, for example, a 
polystyrene microtiter dish or nitrocellulose paper, using techniques known in the art. 
Alternatively, said kits are based on a RIA and contain said antibody marked with a 
radioactive isotope. In a preferred embodiment of the kit of the invention the antibody is 
labeled with enzymes, fluorescent compounds, luminescent compounds, ferromagnetic probes 
or radioactive compounds. The kit of the invention may comprise one or more containers 
filled with, for example, one or more probes of the invention. Associated with container (s) of 
the kit can be a notice in the form prescribed by a governmental agency regulating the 
manufacture, use or sale of pharmaceuticals or biological products, which notice reflects 
approval by the agency of manufacture, us or sale for human administration. 



EXAMPLES 
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The following Examples are intended to illustrate, but not to limit the invention. While such 
Examples are typical of those that might be used, other methods known to those skilled in the 
art may alternatively be utilized. 

Example 1: Materials and Methods 

(A) Isolation of cDNA clones and Northern blot analysis 

Total RNA was isolated from human placenta an prostate using standard techniques. Isolation 
of mRNA was performed with poly (A) 4 KNA - spin columns (New England Biolabs, Beverly, 
USA) according to the instructions of the manufacturer. Poly (a) *RNA was reverse 
transcribed using the cDNA choice system (Gibco-BRL, Rockville, USA) and subcloned in A,- 
Zap phages (Stratagene, La Jolla, USA). An human expressed sequence tag (GenBank 
accession number 1404042) was used to screen an oligo d(T) primed human placenta cDNA 
library. Several cDNA clones were identified and isolated. Additional cDNA clones were 
isolated from two specifically primed cDNA libraries using primers 5'-gca tag gaa ggg aca 
ggt gg-3 ' and 5'-gag agt cga ggt cag tgg tcc-3\ 

cDNA clones were sequenced using a thermocycler (PE Applied Biosystems, USA) and 
Thermo Sequenase (Amersham Pharmacia Biotech Europe, Freiburg, Germany). DNA 
sequences were analyzed with an automated sequencer (Licor, Linccoln, USA). 

For Northern blot analysis 5 human poly (A) + RNA from human placenta or prostate were 
separated by electrophoresis on 0.8 % agarose gels. Poly (A) + RNA was transferred to 
Hybond N nylon membranes (Amersham Pharmacia Biotech Europe, Freiburg, Germany). 
The membranes were hyridized in the presence of 50 % formamide at 42°C over night. DNA 
probes were labelled using [a 32 P]dCTP and the „ready prime,, labelling kit (Amersham 
Pharmacia Biotech Europe, Freiburg, Germany). Commercial Northern blots were hybridized 
according to the distributors instructions (Clontech, Paolo Alto, USA). 

(B) Construction of expression plasmids and transfection of HEK 293 cells 

Lipofections were carried out with the recombinant dicistronic eucaryotic expression plasmid 
pdiTRPS containing the cDNA of TrpSb under the control of the chicken 13-actin promotor 
followed by an internal ribosome entry side (IRES) and the cDNA of the green fluorescent 
protein (GFP). To obtain pdiTRPS carrying the entire protein coding regions of TRPSb and 
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the GFP (Prasher, D.C. et al. (1992), Gene 111, 229-233), the 5^and 3 '-untranslated sequences 
of the TRP8b cDNA were removed, the consensus sequence for initiation of translation in 
vertebrates (Kozak, M. (1987) Nucleic Acids ResearchlS, 8125-8148) was introduced 
immediately 5" of the translation initiation codon and the resulting cDNA was subcloned into 
the pCAGGS vector (Niwa, H., Yamamura, K. and Miyazald, J (1991), Gene 8, 193-199) 
downstream of the chicken P-actin promotor. The IRES derived from encephalmyocarditis 
virus (Kim, D.G., Kang, H.M., Jang, S.K. and Shin H.S. (1992) Mol.CeU.BioL 12, 3636- 
3643) followed by the GFP cDNA containing a Ser65Thr mutation (Heim, R, Cubitt, A.B., 
Tsien, R.Y. (1995) Nature 373, 663-664) was then cloned 3* to the TRP8b cDNA. The IRES 
sequence allows the simultaneous translation of TRP8b and GFP from one transcript. Thus, 
transfected cells can be detected unequivocally by the development of green fluorescence. 

For monitoring of the intracellular Ca 2+ concentration human embryonic kidney (HEK 293) 
cells were cotransfected with the pcDNA3-TRP8b vector and the pcDNA3-GFPvector in a 
molar ratio of 4 : 1 in the presence of lipofectamine (Quiagen, Hilden, Germany). To obtain 
pcDNA3-TRP8b the entire protein coding region of TRP8b including the consensus sequence 
for initiation of translation in vertebrates (Kozak, M. (1987) Nucleic Acids Researchl5, 8125- 
8148) was subcloned into the pcDNA3 vector (Invitrogen, Groningen, Netherlands). Calcium 
monitoring and patch clamp experiments were carried out two days and one day after 
transfection, respectively. 

(C) Chromosomal localization of the Trp8 gene 

The chromosomal localization of the human TRP8 gene was performed using NIGMS 
somatic hybrid mapping panel No.2 (Coriell Institute, Camden, NJ, USA) previously 
described (Drwinga, H.L., Toji, L.H., Kim, C.H., Greene, A.E., Mulivor, R.A. (1993) 
Genomics 16, 311-314; Dubois, B.L. andNaylor, S.L. (1993) Genomics 16, 315-319). 

(D) In Vitro Translation, glutathione - sepharose and calmodulin agarose binding assay 
N- and C-terminal Trp8-fragments were subcloned into the pGEX-4T2 vector (Amersham 
Pharmacia Europe, Freiburg, Germany) resulting in glutathione-S-transferase (GST)-Trp8 
fusion constructs (Fig. 4). The GST-TRP8-fiision proteins were expressed in E. coli BL 21 
cells and purified using glutathione - sepharose beads (Amersham Pharmacia Biotech Europe, 
Freiburg, Germany). 
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In vitro translation of human Trp8 cDNA and Xenopus laevis calmodulin cDNA (Davis, T.N. 
and Thorner, J. Proc.Natl.Acad.Sci. USA 86, 7909-7913.) was performed in the presence of 
35 S-methionine using the TNT coupled transcription/translation kit (Promega, Madison, 
USA). Translation products were purified by gel fliltration (Sephadex G50, Amersham 
Pharmacia Biotech Europe, Freiburg, Germany) and equal amounts of 35 S labeled probes were 
incubated for 2 h with glutathione beads bound to GST - Trp8 or calmodulin - agarose 
(Calbiochem) in 50 mM Tris-HCl, pH 7.4, 0.1 % Triton X-100, 150 mMNaCl in the presence 
of 1 mM Ca 2+ or 2 mM EGTA. After three washes, bound proteins were eluted with SDS 
sample buffer, fractionated by SDS-PAGE and 35 S labeled proteins were detected using a 
Phosphor Imager (Fujifilm, Tokyo, Japan). 

(£) Calcium measurements 

The intracellular Ca 2+ concentration ([Ca 2+ ]i) was determined by dual wavelength fura-2 
fluorescence ratio measurements (Tsien, R.Y. (1988) Trends Neurosci. 11, 419-424) using a 
digital imaging system (T.LL.L. Photonics, Planegg, Germany). HEK cells were grown in 
minimal essential medium in the presence of 10 % fetal calf serum and cotransfected with the 
pcDNA3-TRP8b vector and the pCDNA3-GFP vector as described above (B). Transfected 
cells were detected by development of green fluorescence. The cells were loaded with 4|iM 
fura-2/AM (Molecular Probes, Oregon, USA) for one hour. After loading the cells were 
rinsed 3 times with buffer Bl (10 mM Hepes, 115 mM NaCl, 2 mM MgCl 2 , 5mM KC1, pH 
7.4) and the [Ca 2+ ]i was calculated from the fluorescence ratios obtained at 340 and 380 nm 
excitation wavelengths as described (Garcia, D.E., Cavalie, A and Lux, H.D. (1994) J. 
Neurosci 14, 545-553). 

(F) Electrophysiological recordings 

HEK cells were transfected with the eucaryotic expression plasmid pdiTRP8 described in (B) 
and electrophysiolocigal recordings were carried out one day after transfection. Single cells 
were voltage clamped in the whole cell mode of the patch clamp technique as described 
(Hamill, O.P., Marty, A., Neher, R, Sakmann, B. and Sigworth, FJ. (1981) Pfliigers Arch. 
391, 85-100; Philipp, S., Cavalie, A., Freichel, M., Wissenbach, U., Zimmer, S., Trost, C, 
Marquart, A, Murakami, M. and Flockerzi, V. (1996) EMBO J. 6166-6171). The pipette 
solution contained contained (mM): 140 aspartic acid, 10 EGTA, 10 NaCl, 1 MgC12, 10 
Hepes (pH 7.2 with CsOH) or 125 CsCl, 10 EGTA, 4 CaCl 2 10 Hepes (pH 7,2 with CsOH). 
The bath solution contained (mM): 100 NaCl, 10 CsCl, 2 MgCl 2 , 50 mannitol, 10 glucose, 20 
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Hepes (pH 7,4 with CsOH) and 2 CaCl 2 , or no added CaCl 2 (-Ca 2+ solution). Divalent free 
bath solution contained (mM): 110 N-methyl-D-glucamine (NMDG). Whole cell currents 
were recorded during 100 msec voltage ramps from -100 to +100 mV at varying holding 
potentials. 

(G) In Situ Hybridization 

In situ hybridizations were carried out using formalin fixed tissue slices of 6 - 8 ^iM thickness. 
The slices were hydrated and incubated in the presence of PBS buffer including 10 jig / ml 
proteinase K (Roche Diagnostics, Mannheim, Germany) for 0.5 h. The slices were hybridized 
at 37°C using biotinylated deoxy-oligonucleotides (0.5 pmol / jil) in the presence of 33 % 
formamide for 12 h. Furthermore the slices were several times rinsed with 2 x SSC and 
incubated at 25°C for 0.5 h with avidin / biotinylated horse raddish peroxidase complex 
(ABC, DAKO, Santa Barbara, USA). After several washes with PBS buffer the slices were 
incubated in the presence of biotinylated tyramid and peroxide (0.15 % w/v) for 10 min, 
rinsed with PBS buffer and additionally incubated with ABC complex for 0.5 h. The slices 
were washed with PBS buffer and incubated in the presence of DAB solution 
(diaminobenzidine (50^ig / ml), 50 mM Tris/EDTA buffer pH 8.4, 0.15 % H 2 0 2 in N,N - 
dimethyl-formamide; Merck, Darmstadt, Germany), The detection was stopped after 4 
minutes by incubating the slides in water. Tyramid was biotinylated by incubating NHS-LC 
Biotin (sulfosuccinimidyl-6-(biotinimid)-hexanoat), 2.5 mg / ml; Pierce, Rockford, USA) and 
tyramin-HCl (0.75 mg / ml, Sigma) in 25 mM borate buffer pH 8.5 for 12 h. The tyramid 
solution was diluted 1 - 5 : 1000 in PBS buffer. 

(H) GenBank accession numbers: TRPSa, Aj243500; TRP8b Aj243501 
Example 2: Expression of TKP8 transcripts 

In search of proteins distantly related to the TRP family of ion channels, an human expressed 
sequence tag (EST, GenBank accession number 1404042) was identified in the GenBank 
database using BLAST programms (at the National Center for Biotechnology Information 
(NCBI); Altschul, S.F., Gish, W., Miller, W., Myers, E.W. and Lipman, D.J.J. (1990) Mol. 
Biol. 5, 403-410) being slightly homologous to the VR1 gene. Several human placenta cDNA 
libraries were constructed and screeened with this EST DNA as probe. Several full length 
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cDNA clones were identified and isolated. The fall length cDNA clones encoded two putative 
proteins differing in three amino acids and were termed TrpSa and TrpSb (Fig. lc, 2a, 7 and 
8A). This finding was reproduced by isolating cDNA clones from two cDNA libraries 
constructed from two individual placentas. The derived protein sequence(s) comprises six 
transmembrane domains, a characteristic overall feature of trp channels and related proteins 
(Fig.: lb). The sequence is closely related to the meanwhile published calcium uptake 
transport protein 1 (CaTl), isolated from rat intestine (Peng, J.B., Chen, X.Z., Berger, U.V., 
Vassilev, P.M., Tsukaguchi, H., Brown, E.M. and Hediger M.A.(1999) J Biol Chem. 6;274, 
22739-22746) and to the epithelial calcium uptake channel (ECaC) isolated from rabbit 
kidney (Hoenderop, J.G., van der Kemp, A. W., Hartog, A., van de Graaf, S.F., van Os, C.H., 
Willems, P.H. and Bindels, R.J. (1999) J Biol Chem. 26;274, 8375-8378). Expression of 
TrpSa/b transcripts are detectable in human placenta, pancreas and prostate (Fig.: 5) and the 
size of the Northern signal (3.0 kb) corresponds with the size of the isolated fall length 
cDNAs. In addition, a shorter transcript of 1.8 kb, probably a splice variant, is detectable in 
human testis. The Trp8 mRNA is not expressed in small intestine or colon (Fig.: 5) 
implicating that Trp8 is not the human ortholog of the rat CaTl or rabbit ECaC proteins. To 
investigate whether there are other related sequences Trp8aft derived primers (UW241, 5'- 
TAT GAG GGT TCA GAC TGC-3' and UW242, 5'-CAA AGT AGA TGA GGT TGC-3') 
were used to amplify a 105 bp fragment from human genomic DNA being 95% identical on 
the nucleotide level to the Trp8 sequence (data not shown). This indicates the existence of 
several similar sequences in humans at least at the genomic level. 

Example 3: Two variants of the Trp8 protein (Trp8a and Trp8b) arise by polymorphism 

Two variants of the Trp8 cDNA were isolated from human placenta (Fig.: 2A, 7 and 8 A) 
which encoded two proteins which differ in three amino acids and were termed Trp8a and 
TrpSb. Trp8a/b specific primers were designed to amplify a DNA fragment of 458 bp of the 
Trp8 gene from genomic DNA isolated from human T-lymphocytes (primer pair: UW243, 5'- 
CAC CAT GTG CTG CAT CTA CC-3' and UW244, 5'-CAA TGA CAG TCA CCA GCT 
CC-3'). The amplification product contains a part of the sequence where the derived protein 
sequence of TrpSa comprises the amino acid valine and the Trp8b sequence methionine as 
well as a silent base pair exchange (g versus a) and an intron of 303bp (Fig.: 2. A, B). Both 
variants of the Trp8 genes (a,b) were amplified from genomic DNA in equal amounts 
indicating the existence of both variants in the human genome and therefore being not the 
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result of RNA editing (Fig.: 2B). The Trp8a gene can be distinguished from the TrpSb gene 
by cutting the genomic fragment of 458bp with the restriction enzyme Bspl286I (Fig. 2B). 
Using human genomic DNA isolated from blood of twelve human subjects as template, the 
458bp fragment was amplified and restricted with BSP1286I. In eleven of the tested subjects 
only the TrpSb gene is detectable, while one subject (7) contains Trp8a and Trp8b genes (Fig.: 
2D). These implicates that the two Trp8 variants arise by polymorphism and do not represent 
individual genes. Using Trp8 specific primers and chromosomal DNA as template, the Trp8 
locus is detectable on chromosome 7 (Fig.: 2C). 

Example 4: Trp8b is a calcium permeable channel 

The protein coding sequence of the Trp8b cDNA was subcloned into pcDNA3 vector 
(Invitrogen, Groningen, Netherlands) under the control of the cytomegalovirus promotor 
(CMV). Human embryonic kidney (HEK 293) cells were cotransfected with the Trp8b 
pcDNA3 construct (pcDNA3-Trp8b vector) and the pcDNA3-GFPvector encoding the green 
fluorescent protein (GFP) in 4:1 ratio. The Trp8b cDNA and the cDNA of the reporter, GFP, 
was transiently expressed in human embryonic kidney (HEK 293) cells. The intracellular Ca 2+ 
concentration ([Ca 2+ ]i) and changes of [Ca 2+ ]j were determined by dual wavelength fura-2 
fluorescence ratio measurements (Fig.: 3F) in cotransfected cells which were identified by the 
green fluorescence of the reporter gene GFP. 

Dual wavelength fiira-2 fluorescence ratio measurement is a standard procedure (e.g. in: An 
introduction of Molecular Neurobiology (ed. Hall, Z.W.)Sinauer Associates, Sunderland, 
USA (1992)) using fiira-2, which is a fluorescent Ca 2+ sensitive dye and which was designed 
by R.Y.Tsien (e.g. Trends Neurosci. 11, 419-424 (1988) based upon the structure of EGTA. 
Its fluorescence emission spectrum is altered by binding to Ca 2+ in the physiological 
concentration range. In the absence of Ca 2+ , fura-2 fluoresces most strongly at an excitation 
wavelength of 385 nm; when it binds Ca 2+ , the most effective excitation wavelength shifts to 
345 nm. This property is used to measure local Ca 2+ concentrations within cells. Cells can be 
loaded with fura-2 esters (e.g. fura-2 AM) that diffuse across cell membranes and are 
hydrolyzed to active fura-2 by cytosolic esterases. 

In the presence of ImM Ca 2+ , Trp8 expressing cells typically contained more than 300 nM 
cytosolic Ca 2+ , while non transfected controls contained less than 100 nM Ca 2+ ions (Fig. 3F). 
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When TrpSb transfected cells were incubated without extracellular Ca 2+ , the intracellular Ca 2+ 
concentration ([Ca 2+ ]i) decreased to levels comparable to non transfected cells. Readdition of 
ImM Ca 2+ to the bath resulted in significant increase of the cytosolic [Ca 2+ ] in TrpSb 
transfected cells, but not in controls (Fig.: 3F). After readdition of Ca 2+ ions to the bath 
solution, the cytosolic Ca 2+ concentration remains on a high steady state level in the Trp8b 
transfected cells. 

Example 5: TrpS expressing cells show calcium selective inward currents 

To characterize in detail the electrophysiological properties of TRPS, TRP8 and GFP were 
coexpressed in HEK293 cells using the dicistronic expression vector pdiTRP8 and measured 
currents using the patch clamp technique in the whole cell mode (Hamill, O.P., Marty, A., 
Neher, E., Sakmann, B. and Sigworth, FJ. (1981) Pflugers Arch., 391, 85-100). 

The eucaryotic expression plasmid pdiTRP8 contains the cDNA of Trp8b under the control of 
the chicken B-actin promoter followed by an internal ribosome entry side (IRES) and the 
cDNA of the green fluorescent protein (GFP). To obtain pdiTRP8 carrying the entire protein 
coding regions of TRP8b and the GFP (Prasher, D.C. et al. (1992), Gene 111, 229-233), the 
5' and 3 '-untranslated sequences of the TRP8b cDNA were removed, the consensus sequence 
for initiation of translation in vertebrates (Kozak, M. (1987) Nucleic Acids Researchl5, 8125- 
8148) was introduced immediately 5* of the translation initiation codon and the resulting 
cDNA was subcloned into the pCAGGS vector (Niwa, H. 5 Yamamura, K. and Miyazaki, J 
(1991), Gene 8, 193-199) downstream of the chicken p-actin promoter. The IRES derived 
from encephalmyocarditis virus (Kim, D.G., Kang, H.M., Jang, S.K. and Shin H.S. (1992) 
Mol.Cell.Biol. 12, 3636-3643) followed by the GFP cDNA containing a Ser65Thr mutation 
(Heim, R., Cubitt, A.B., Tsien, R.Y. (1995) Nature 373, 663-664) was then cloned 3^ to the 
TRP8b cDNA. The IRES sequence allows the simultaneous translation of TRPSb and GFP 
from one transcript. Thus, transfected cells can be detected unequivocally by the development 
of green fluorescence. 

In the presence of 2 mM external calcium, TrpSb transfected HEK cells show inwardly 
rectifying currents, the size of which depends on the level of intracellular calcium and the 
electrochemical driving force. The resting membrane potential was held either at -40 mV, or, 
to lower the driving force for calcium influx in between pulses, at + 70 mV. Current traces 
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were recorded in response to voltage ramps from -100 to +100 mV, that were applied every 
second. To monitor inward and outward currents over time, we analyzed the current size at — 
80 and + 80 mV of the ramps. Figure 3A shows a representative trace of the current at - 80 
mV over time. Both at a holding potential of —40 mV or at +70 mV, the currents are 
significantly larger than in cells transfected with only the GFP containing vector (Fig.: 3E). 
Interestingly, after changing to a positive holding potential, current size in Trp8 transfected 
cells slowly increases and reaches steady state after approximately 70 seconds (Fig.: 3 A). To 
determine the selectivity of the induced currents, we then perfused the cells with solutions that 
either contain no sodium, no added Ca 2+ (Fig. 3 A, C) or a sodium containing, but divalent ion 
free bath solution. To control for the effect of the solution change alone, we also perfused 
with normal bath (see puff in Fig. 3 A). While removal of external Ca 2+ completely abolishes 
the trp 8 induced currents - the remaining current being identical in size and shape to the 
control (Fig.: 3 A, C, E), removal of external sodium has no effect (Fig.: 3E). An important 
hallmark of calcium selective channels (e.g. Vennekens, R., Hoenderop, G.J., Prenen, J., 
Stuiover, M., Willems, PHGM, Droogmans, G., Nilius, B.and Bindels, R.J.M (1999) J. Biol. 
Chem. 275, 3963-3969), is their ability to conduct sodium only if all external divalent ions, 
namely Ca 2+ and magnesium are removed. To test whether the trp 8 channel conforms with 
this phenomenon normal bath solution was switched to a solution containing only sodium and 
1 mM EGTA. As can be seen in Figure 3B and D, Trp8 transfected cells can now conduct 
very large sodium currents. Interestingly, immediately after the solution change, the currents 
first become smaller before increasing rapidly, indicating that the pore may initially still be 
blocked by calcium a phenomenon usually called anomalous mole fraction behaviour 
(Warnat, J., Philipp, S., Zimmer, S.,.Flockerzi, V., and Cavalie A.(1999) J.PhysioL (Lond) 
518, 631-638). The measured outward currents of Trp8 transfected cells in normal bath 
solution are not significantly different from non-transfected control cells or cells which only 
express the reporter gene GFP. As the removal of external Ca 2+ abolishes the Trp8 specific 
current, the remaining current was subtracted from the current before the solution change to 
obtain the uncontaminated Trp8 conductance (see inset in Fig.: 3C). As expected from the 
given ionic conditions (high EGTA inside, 2 mM Ca 2+ outside), the current-voltage 
relationship now shows prominent inward rectification with little to no outward current. 

Both the time course of the development of Trp8 currents and the size of the currents depend 
on the frequency of stimulation (data not shown), the internal and external Ca 2+ concentration 
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and the resting membrane potential, suggesting that Trp8 calcium conductance is intrically 
regulated by a Ca 2+ mediated feedback mechanisms. 

Example 6: Ca 2+ / calmodulin binds to the C-terminus of the Trp8 protein 

To test whether calmodulin, a prime mediator of calcium regulated feedback, is involved, first 
it was investigated biochemically whether Trp8 protein can bind calmodulin. TrpS cDNA was 
in vitro translated in the presence of 35 S-methionine and the product incubated with 
calmodulin-agarose beads. After several washes either in the presence or abscence of Ca 2+ , 
the beads were incubated in Laemmli buffer and subjected to SDS-polyacrylamide gel 
electrophoresis. In the presence of Ca 2+ (ImM), but not in the absence of Ca 2+ , Trp8 protein 
binds to calmodulin (Fig. : 4B). 

To narrow down the binding site, two approaches were undertaken: Firstly, GST-TRP8 
fusion proteins of various intracellular domains of Trp8 were constructed, expressed in E. coli 
and bound to gluthathione sepharose beads. These beads were then incubated with in vitro 
translated 35 S- labeled calmodulin, washed and subjected to gel electrophoresis. Secondly, 
truncated versions of in vitro translated Trp8 protein were used in the above described binding 
to calmodulin-agarose. As shown in Figure 4 A, and C, fusion proteins of the N-terminal 
region (Nl, N2) of TrpS did not bind calmodulin, while C-terminal fragments (CI, C2, C3, 
C4) showed calmodulin binding in the presence of calcium (for localization of fragments 
within the entire TrpS protein see Fig. 4C). Accordingly, a truncated version of in vitro 
translated Trp8, which lacks the C-terminal 32 amino acid residues did not bind to 
calmodulin-agarose (4B). We have restricted the calmodulin binding site to amino acid 
residues 691 to 711 of the Trp8 protein. This calmodulin binding site does not resemble the 
typical conserved IQ - motif of conventional myosins, but has limited sequence homology to 
the calcium dependent calmodulin binding site 1 of the transient receptor potential like (trpl) 
protein of Drosophila melanogaster (Warr and Kelly, 1996) with several charged amino acid 
residues conserved. The sequence of the calmodulin binding site of the TrpS protein 
resembles a putative amphipathic a-helical wheel structure with a charged and a hydrophobic 
site according to a model proposed by Erickson-Vitanen and De Grado (1987, Methods 
Enzymol. 139, 455-478.). 
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Example 7: Expression of TrpS transcripts in human placenta and pancreas 

Several slides from a human placenta of a ten week old abort were used for in situ 
hybridization experiments. The in situ hybridization experiments revealed expression of TrpS 
transcripts in human placenta (Fig.: 5B). Expression was detectable in trophoblasts and 
syncytiotrophoblasts of the placenta, but not in Langhans cells. 

TrpS transcripts are detectable in human pancreas (Fig.: 5 A). Therefore Trp8 probes were 
hybridized to tissue sections of human pancreas. The pancreatic tissues were removed from 
patients with pancreas cancer. Trp8 expression is detectable in pancreatic acinar cells, but not 
in Langerhans islets (Fig.: 5C). No Trp8 expression was found in regions of pancreatic 
carcinomas (data not shown). 

Furthermore, the TrpS cDNA is not detectable in human colon nor in human kidney by in situ 
hybridization as well as by Northern analysis (Fig.: 5 A, D). The Northern results taken 
together with the in situ expression data indicate that the Trp8 protein is not the human 
ortholog of the CaTl and ECaC channels cloned from rat intestine (Peng, J.B., Chen, X.Z., 
Berger, U.V., Vassilev, P.M., Tsukaguchi, H., Brown, E.M. and Hediger M.A.(1999) J Biol 
Chem. 6;274, 22739-22746) and from rabbit kidney (Hoenderop, J.G., van der Kemp, A.W., 
Hartog, A., van de Graaf, S.F., van Os, C.H., Willems, P.H. and Bindels, RJ. (1999) J Biol 
Chem. 26;274, 8375-837S), respectively. Trp8 is unlikely to represent the human version of 
CaTl as its expression is undetectable in the small intestine and colon tissues where CaTl is 
abundantly expressed. If, however, TrpS is the human version of rat CaTl, a second gene 
product appears to be required for Ca 2+ uptake in human small intestine and colon attributed 
to CaTl in rat small intestine and colon. 

Example 8: Differential expression of TrpS transcripts in benign and malign tissue of 
the prostate 

The Trp8 transcripts are expressed in human prostate as shown by hybridization of a Trp8 
probe to a commercial Northern blot (Clontech, Palo Alto, USA) (Fig.: 5 A). Trp8 transcripts 
were not detectable by Northern blot analysis using pooled mRNA of patients with benign 
prostatic hyperplasia (BPH) (Fig.: 5 A, prostate*). To examine Trp8 expression on the cellular 
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level, sections of prostate tissues were hybridized using TrpS specific cDNA probes (Table 3). 
Expression of TrpS transcripts is not detectable in normal prostate (n = 3), benign hyperplasia 
(BPH, n = 15) or prostatic intraepithelial neoplasia (PIN, n = 9) (Fig.: 6A, C, E). TrpS 
transcripts were only detectable in prostate carcinoma (PCA), although with different 
expression levels. Low expression levels were found in primary carcinomas (2 - 10 % of the 
carcinoma cells, n = 8) (Fig.: 7B) . Much stronger expression was detectable in rezidive 
carcinoma (10 - 60 %) (Fig.: 7D, n = 6) and metastases of the prostate (60 - 90 %, n = 4) 
(Fig.: 7F). Thus it has to be concluded that the commercial Northern blot used in Fig.: 5 A 
contains not only normal prostate mRNA as indicated by the distributor. According to the 
distributors instructions the prostate mRNA used for this Northern blot was collected from 15 
human subjects in the range of 14 to 60 years of age. This prostate tissue was not examined 
by pathologic means. Since Trp8 expression is not detectable in normal or benign prostate, 
this finding implicates that the mRNA used for this Northern blot was extracted in part from 
prostatic carcinoma tissue. To summarize, Trp8 expression is only detectable in malign 
prostate and, thus, the Trp8 cDNA is a marker for prostate carcinoma. The results are 
summarized in Table 4. 

Table 3 

Trp8 probes used for in situ hybridization: 
Probes (antisense) 

1. ) 5' TCCGCTGCCGGTTGAGATCTTGCC 3' 

2. ) 5' CTTGCTCCATAGGCAGAGAATTAG 3' 

3. ) 5' ATCCTC AGAGCCCCGGGTGTGGAA3 ' 

Controls (sense) 

1. ) 5' GGCAAGATCTCAACCGGCAGCGGA 3' 

2. ) 5' CTAATTCTCTGCCTATGGAGCAAG 3' 

3. ) 5' TTCCACACCCGGGGCTCTGAGGAT 3' 



Table 4 

Prostate total negative positive 

normal 3 3 0 

BPH 15 15 0 

PIN 9 9 0 
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(B) Differential expression of Trp8 transcripts in benign and malign tissue of the uterus 

Moreover it could be shown that Trp8 is expresed in endometrial cancer (also called cancer of 
the uterus, to be distinguished from uterine sarcoma or cancer of the cervix) whereas no 
expression was observed in normal uterus tissue. Thus, Trp8 also is a specific marker for the 
diagnosis of the above cancer (Fig. 12). 

Example 9: Characterization of Trp9 

The complete protein coding sequence of Trp9 was determined (Fig. 9). Trp 9 transcripts are 
predominantly expressed in the human prostate and in human colon. As it could be shown by 
Northern blot analysis, there is no difference of the expression of TRP9 in benigne prostata 
hyperplasia (BPH, Fig. 13, upper panel left) or prostate carcinoma (Fig. 13, upper panel 
right). However, Trp9 is useful as a reference marker for prostata carcinoma, i.e. can be used 
for quantifying the expression level of TrpS. The ratio of the expression of Trp8:Trp9 in 
patients and healthy individuals is useful for the development of a quantitative assay. 



Example 10: Characterization of TrplO 

The complete protein coding sequence of TRP 10 (a and b) was determined by biocomputing 
(Fig. 10 and 11). Using a 235 bp fragment of the TrplO cDNA as probe in Northern blot 
analysis TRP 10 transcripts could only be detected in mRNA isolated from individuals with 
prostate cancer (Fig. 13, bottom panel) but not in mRNA isolated from benign tissue of the 
prostate (prostate BPH) nor in mRNA isolated from heart, brain, placenta, lung, liver, skeletal 
muscle, kidney and pancreas. The 235 bp cDNA fragment of the TrplO cDNA was amplified 
using the primer pair UW248 5'-ACA GCT GCT GGT CTA TTC C-3' and UW249 5'-TAT 
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GTG CCT TGG TTT GTA CC-3' and prostate cDNA as template. In summary, TrplOa and 
TrplOb , like TRP8 are also expressed in malignant prostate tissue. So far, its expression 
could not be observed in any other tissue examined (see above). Thus, Trp 10a and TrplOb 
are also useful markers which are specific for malignant prostate tissue. 

Furthermore, database searches in public databases of the national center for biological 
information (NCBI) revealed the existence of several expressed sequence tags (EST clones) 
being in part identical to the Trp 10 sequence. These EST clones were originally isolated from 
cancer tissues of lung, placenta, prostate and from melanoma. These clones include the 
clones with the following accession numbers: BE274448, BE408880, BE207083, BE791173, 
AI671853, BE3 90627. The results demonstrate that cancer cells of these tissues express 
TrplO related transcripts whereas no expression of TrplO transcripts in the corresponding 
healthy tissues are detectable (Figure 13). Furthermore, it could be shown that in cancer cells 
of melanoma and prostate cancer TrplO transcripts are expressed as shown by in situ 
hybridizations using 4 antisense probes (Figure 14A - E and 13K-0 and Table 2, above). 
Furthermore, it could clearly be shown that cancer cells of these tissues expressing TrplO 
transcripts also express TrplO-antisense transcripts as shown in Figure 14F-J, Figure 14P-R 
and Figurel4T by in situ hybridizations using 4 sense probes (Table 2, above). The in situ 
hybridization experiments demonstrate that detection of a subset of cancer cells derived from 
carcinoma of lung, placenta, prostate and melanoma is feasible using antisense as well as 
sense probes complementary to TrplO transcripts or complementary to TrplO-antisense 
transcripts, respectively. 

The foregoing is meant to illustrate but not to limit the scope of the invention. The person 
skilled in the art can readily envision and produce further embodiment, based on the above 
teachings, without undue experimentation. 
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What Is claimed Is: 

1. An isolated nucleic acid molecule encoding the human prostate carcinoma associated 
protein TrpSa, Trp8b, Trp9, TrplOa or TrplOb or a protein exhibiting biological properties 
of Trp8a, TrpSb, Trp9, TrplOa or TrplOb and being selected from the group consisting of 

(a) a nucleic acid molecule encoding a protein that comprises the amino acid sequence 
depicted in Figure 7, 8A, 9, 10 or 1 1; 

(b) a nucleic acid molecule comprising the nucleotide sequence depicted in Figure 7, 8A, 
9, 10 or 11; 

(c) a nucleic acid molecule included in DSMZ Deposit No. DSM 13579, DSM 13580, DSM 
13584 , DSM 13581 or DSM....; 

(d) a nucleic acid molecule which hybridizes to a nucleic acid molecule specified in (a) to (c); 

(e) a nucleic acid molecule the nucleic acid sequence of which deviates from the nucleic 
sequences specified in (a) to (d) due to the degeneration of the genetic code; and 

(f) a nucleic acid molecule, which represents a fragment, derivative or allelic variation of a 
nucleic acid sequence specified in (a) to (e). 

2. A recombinant vector containing the nucleic acid molecule of claim 1 

3. The recombinant vector of claim 2 wherein the nucleic acid molecule is operatively linked 
to regulatory elements allowing transcription and synthesis of a translatable RNA in 
prokaryotic and/or eukaryotic host cells. 

4. A recombinant host cell which contains the recombinant vector of claim 3. 

5. The recombinant host cell of claim 4, which is a mammalian cell, a bacterial cell, an insect 
cell or a yeast cell. 

6. An isolated protein exhibiting biological properties of the human prostate carcinoma 
associated protein Trp8a, Trp8b, Trp9, TrplOa or TrplOb which is encoded by a nucleic acid 
molecule of claim 1. 

7. A recombinant host ceil that expresses the isolated protein of claim 6. 
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8. A method of making an isolated protein exhibiting biological properties of the human 
prostate carcinoma associated protein Trp8a, TrpSb, Trp9, TrplOa or TrplOb comprising: 

(a) culturing the recombinant host cell of claim 6 under conditions such that said protein is 
expressed; and 

(b) recovering said protein. 

9. The protein produced by the method of claim 8. 

10. An antisense RNA sequence characterized in that it is complementary to an mRNA 
transcribed from a nucleic acid molecule of claim 1 or a part thereof and can selectively bind 
to said mRNA or part thereof, said sequence being capable of inhibiting the synthesis of the 
protein encoded by said nucleic acid molecule. 

1 1 . A ribozyme characterized in that it is complementary to an mRNA transcribed from a 
nucleic acid molecule of claim 1 or a part thereof and can selectively bind to and cleave said 
mRNA or part thereof, thus inhibiting the synthesis of the protein encoded by said nucleic 
acid molecule. 

12. An inhibitor characterized in that it can suppress the activity of the protein of claim 6. 

13. A method for diagnosing a prostate carcinoma which comprises contacting a target 
sample suspected to contain the protein Trp8a, TrpSb, TrplOa and/or TrplOb or the Trp8a, 
Trp8b, TrplOa and/or TrplOb encoding mRNA with a reagent which reacts with TrpSa, 
Trp8b, TrplOa and/or TrplOb or the TrpSa, Trp8b, TrplOa and/or TrplOb encoding mRNA 
and detecting Trp8a, Trp8b, TrplOa and/or TrplOb or the Trp8a, TrpSb, TrplOa and/or 
TrplOb encoding mRNA. 

14. The method of claim 13, wherein the reagent is a nucleic acid. 

15. The method of claim 13, wherein the reagent is an antibody. 

16. The method of claim 13, wherein the reagent is detectably labeled. 
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17. The method of claim 16, wherein the label is selected from the group consisting of a 
radioisotope, a bioluminescent compound, a chemiluminescent compound, a fluorescent 
compound, a metal chelate, or an enzyme. 

18. A method for diagnosing an endometrial cancer (carcinoma of the uterus) which 
comprises contacting a target sample suspected to contain the protein Trp8a and/or TrpSb or 
the TrpSa and/or Trp8b encoding mRNA with a reagent which reacts with Trp8a and/or Trp8b 
or the TrpSa and/or Trp8a and/or trp8b encoding mRNA and detecting Trp8a and/or Trp8b or 
the Trp8a and/or TrpSb encoding mRNA. 

19. The method of claim 18, wherein the reagent is a nucleic acid. 

20. The method of claim 18, wherein the reagent is an antibody. 

21. The method of claim 18, wherein the reagent is detectably labeled. 

22. The method of claim 21, wherein the label is selected from the group consisting of a 
radioisotope, a bioluminescent compound, a chemiluminescent compound, a fluorescent 
compound, a metal chelate, or an enzyme. 

23. A method for diagnosing a melanoma, chorion carcinoma, cancer of the lung and of the 
prostate in a tissue of a subject, comprising contacting a sample with a reagent which detects 
TrplOa and/or TrplOb antisense RNA or TrplOa and/or TrplOb related antisense RNA. 

24. A method for preventing, treating, or ameliorating a prostate tumor, endometrial cancer 
(carcinoma of the uterus) tumor, a chorion carcinoma, cancer of the lung or melanoma, which 
comprises administering to a mammalian subject a therapeutically effective amount of a 
reagent which decreases or inhibits expression of TrpSa, TrpSb, TrplOa and/or TrplOb and/or 
the activity of Trp8a, Trp8b, TrplOa and/or TrplOb. 

25. The method of claim 24, wherein the reagent is a nucleotide sequence comprising an 
antisense RNA. 
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26. The method of claim 24, wherein the reagent is a nucleotide sequence comprising a 
ribozyme. 



27. The method of claim 24, wherein the reagent is an inhibitor of Trp8a, Trp8b, TrplOa 
and/or Tip 10b. 

28. The method of claim 27, wherein the reagent is an anti-Trp8a-, anti TrpSb-, anti-TrplOa- 
and/or anti-TrplOb antibody or a fragment thereof. 

29. A diagnostic kit useful for the detection of TrpSa, Trp8b, TrplOa and/or TrplOb or Trp8a, 
Trp8b, TrplOa and/or TrplOb encoding mRNA or TrplOa and/or TrplOb antisense transcripts 
in a sample, wherein the presence of an increased concentration of Trp8a, Trp8b, Trp9, 
TrplOa and/or TrplOb or TrpSa, Trp8b, Trp9, TrplOa and/or TrplOb encoding mRNA or 
TrplOa and/or TrplOb antisense transcripts is indicative for a prostate tumor, endometrial 
cancer (cancer of the uterus) tumor, a chorion carcinoma, cancer of the lung or melanoma, 
said kit comprising a probe for detection of Trp8a, TrpSb, Trp9, TrplOa or TrplOb or Trp8a, 
Trp8b, Trp9, TrplOa and/or TrplOb encoding mRNA or TrplOa and/or TrplOb antisense 
transcripts. 

30. The kit of claim 29, wherein the target component to be detected is Trp8a, TrpSb, Trp9, 
TrplOa and/or TrplOb and the probe is an antibody. 

31. A method for identifying a compound which acts as an agonist or antagonist on the ion 
channels TrpS, Trp9 and/or TrplO, said method comprising contacting a test compound 
with the ion channel Trp8, Trp9 and/or Trpl0,and determining whether said test compound 
affects the calcium uptake. 
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Fig. 1C 
C 



htrp8A MG 2 

htrp8B MG 2 

Vrl MEQRASIJKEESESPPQENSCIJJPPDimPNCKPPFVKPHI FTTRSR7RLFGXGDSEEASP 60 

ECaC MG 2 

htrp8A LSL^KEK^ LIIdWSKECRWFQRR E5HAQ5RBEQNLLQQK-BJWESP-LLIA 52 

htrp8B LSLF.KEKG LILCLWSKFOWPQRR ESW7^RgEQNLWJQK-^tWESP-LLIA 52 

Vrl LDCP, YEDSGIASCPI I TVSSVLTI QRPGDGPASVRPSSODSVSAGEKPPRLYDRRS I FDA 120 

ECaC ACPfcKflNS FW«2LCKLLISWPVGE QWEQYR^/NMUXJE-jWRDSP-LL^ 52 

htrp8A AraNDVO^S^WED^VH QI ^3*MliiA& 102 

ECaC AKEND1JUXKILlL12^QSCDFQ QRGAVGESA&iVAA^- YVK— UJ&TLZHEXA 102 

htrpSA PEIVFEFHTSELYEGQiMEiAVW^ 156 

htrpBB PELVFEPMTSEI/fEGSIAUM^VV^ 156 

Vrl RJCTDSUCQFVNASYTDSYYKGCITMfitA^ 240 

ECaC FEIAKEPAIXXPFVGGraBi^V^ 156 

htrp8A -|NLX^E^K^raACVNSEEmitlEHG A^P^QSO^TpVLfir^ILQP 207 

htrp8B -f^I?fFt5|l^FaacWSEE3V^IEHG J^pJjC^ljGNTyiaiglLQP 207 

ECaC -H^I^rap^FjftA^GSEElVRl^IEHG ^IPAQ^^GmvjSlglLQP 207 

htrp8A Kp5^C*TYp LT3TL^d£TEIDSSGDE(^ Lt£l 1 1 TTK-KREAR-QIj&DQT£VK 314 

htrpBB K£tfOT<»n^LT|n^liTE3BSSG^^ 314 

Vrl EX3UiLSSaCFTE^YjSPVH^Spn3isC^^ 420 

ECaC ^HVQ^TC^LT§TLXQLTE^SWGEEXS F?i| LWSS K- KREAft- QI^EQTgVK 314 

htrpSA E|fc/Slii^YGRPY3FCMlJGAl£li£rc^ 374 

htrpSB E^VSLjtCrtKRYGRPY^CMLGAIcnjUi l^CSTOCC I jCB^LKPRTNNRTS^RCNTLIyQQl^LQ 374 

Vrl PjilCD,KWDRFVKRXFYFNFFVYCiY^ LPp YKLKN — 468 

ECaC EiVS FKWKKYGRP Yip/IASL^lt^ CPTTCC Ig^LKLRDDNRTD^RDI TI LQQKLLQ 374 



htrpSA EAY|T PKSDI&LVGELVTVlGAr I IIJ.VEVPDIFRMGVTRFFtKJTILGGPFHVLl ITYAF 434 

htrpBB EAY§T PK^DIRLVGELVTVIGAl 1 1 LLVEVPDI FRMGVTRFFGQTl LGGP FHVLI ITYAF 434 

Vrl ^GfoYEj&TTj^ILSySGGVYFFETlGI QYFLQRRPS — UCSLFVDSYSEILFEVQSL 522 

ECaC EAYV^Q^I^V^VTyTGAVI I LLLEIPDI FRVGASRYFGQTIU5GPFHVI I ITYAS 434 
S2 S3 



htrpSA MVLVTMtTMRLISASQ^^/PM^ 494 

htrpSB MV£VTJ$MRLISAS(Sah/PMSEAm 494 

Vrl FKEVSV^YFSQRKEYVASMV^^ 582 

ECaC LVi, LTMVMRLTNMNGEVVP 1^ EAL\TL&WCSV>Cf ET^Ep^PFTIMIQ^FOTI^lRyC 494 

" ' 5 4 " 



htrpSA WLMAVVI U5FASAFY 1 1 FQTED PEE LB-HFYDYPMALFSTEELV 538 

htrpQB WIMAVVII/3EASAFYIIPQTED PEE LG-HFYDYPMALPSTFELV 538 

Vrl FVYIATFLFGFS TAWTL I EDGKNNS LPMES FPHKCRGSACKPGNS YNSLYSTCIJSLFKFT 642 

ECaC WIWA^ILGEASAFHITFQTED |NN L§-EFSDYPTALFSTgELF 538 

_S5 

htrpBA LTIIitePANYNVj6lJ>n«S";|TYA?^IATtf^^ 598 
htrpBB LTIIDGPANYfJVBLPFMYSlrrYAREAttlATlJ^^ 598 
Vrl IGMGpi£FmJY#F10WFl£LIJAYvfo 702 
ECaC liTI IBGPAOTSV|3LPR$YC^TYAAFA1 U^l^^^Li^f^^^I^HWRVnCpfWElMMQV 598 
S6 

htrpBA VA^TVMIXRKLPRCLWP — RSG 1 CGREYGLGD — FLR^TEDRQDLNRQRI QRYAQA 671 

htrpBB VA5fTVMl£RICLPRCLWP--pSG ICGRE YGLGD — flJfFLRvEDHQnLNRQRI QRYAQA 671 

Vrl AIIILDlteFLKQlPJCAE^SGKI^VpFTPM 762 

ECaC VA^TVMLERKMPRFLWP — RSG ICGYEYGLGg — ^FI^VENHHDQNPLRVLRYVEA 671 

htrpBA FKTR GSEDLDKDSV-EKLELGCPFSPHLSLPl PSVSRSTSRSSANWER1R.QGTLRR 726 

htrpBB FHTR GSEDI^KDSV-EKI£ljfcPFSPHI^Lp|pSVSRST^ 726 

Vrl DPGN-^EGVKRTl^FSIJ^GRVSpRNWKNFAL 820 

ECaC FKCSDKET^QEQLSEKRP-STVESGMl^RASVAFQTPSLSRTTSQSSN — SHRGWEILRR 728 

htrpSA DLRGI INRGLEDGESWE YQI * 746 

htrpBB DLRGI INRGLEDGESWEYQI * 746 

Vrl S LKPEDAEVFKDSMVPGEK * 839 

ECaC NTLGHLNLGLDLGEGDGEEVYHF* 751 
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10 30 50 

GCCAAGTGTAACAAACTCACAGCCCTCTCCAAACTGGCTGGGGCTGCTGGGA.GACTCCCA 

70 90 HO 

AGGAACTCGTCAGGAAGGCAGGAGACAGGAGACGGGACCTCTACAGGGAGACGGTGGGCC 

130 150 170 

GGCCCTTGGGGGGGCTGATGTGGCCCCAAGGCTGAGTCCCGTCAGGGTCTGGCCTCGGCC 

190 210 230 

TCAGGCCCCCAAGGAGCCGGCCCTACACCCCATGGGTTTGTCACTGCCCAAGGAGAAAGG 

MGIiSLPKEKG 
250 270 290 

GCTAATTCTCTGCCTATGGAGCAAGTTCTGCAGATGGTTCCAGAGACGGGAGTCCTGGGC 
LILCLWSKFCRWFQRRESWA 
310 330 350 

CCAGAGCCGAGATGAGCAGAACCTGCTGCAGCAGAAGAGGATCTGGGAGTCTCCTCTCCT 
QSRDEQNLLQQKRIWESPLL 
370 '390 410 

TCTAGCTGCCAAAGATAATGATGTCCAGGCCCTGAACAAGTTGCTCAAGTATGAGGATTG 
L A A K D N' DVQA LNK LL K Y E D C 
430 450 470 

CAAGGTGCACCAGAGAGGAGCCATGGGGGAAACAGCGCTACACATAGCAGCCCTCTATGA 
KVHQRGAMGETALHIAALYD 
490 510 530 

CAACCTGGAGGCCGCCATGGTGCTGArGGAGGCTGCCCCGGAGCTGGTCTTTGAGCCCAT 
N LEAA MVLMEAAPELV FE P M 
550 570 590 

GACATCTGAGCTCTATGAGGGTCAGACTGCACTGCACATCGCTGTTGTGAACCAGAACAT 
TSELYEGQTALHIAVVNQNM 
610 630 650 

GAACCTGGTGCGAGCCCTGCTTGCCCGCAGGGCCAGTGTCTCTGCCAGAGCCACAGGCAC 
NLVRALLARRASVSARATGT 
670 690 710 

TGCCTTCCGCCGTAGTCCCCGCAACCTCATCTACTTTGGGGAGCACCCTTTGTCCTTTGC 
AFRRSPRNLIYFGEHPLSFA 
730 750 770 

TGCCTGTGTGAACAGTGAGGAGATCGTGCGGCTGCTCATTGAGCATGGAGCTGACATCCG 
ACVNSEE IVRLLIEHGADIR 
790 810 830 

GGCCCAGGACTCCCTGGGAAACACAGTGTTACACATCCTCATCCTCCAGCCCAACAAAAC 
AQDSLGNTVLHILILQPNKT 
850 870 890 

CTTTGCCTGCCAGATGTACAACCTGTTGCTGTCCTACGACAGACATGGGGACCACCTGCA 
FACQMYNLLLSYDRHGDHLQ 
910 930 950 

GCCCCTGGACCTCGTGCCCAATCACCAGGGTCTCACCCCTTTCAAGCTGGCTGGAGTGGA 
PLDLVPNHQGLTPFKLAGVE 
970 990 1010 

GGGTAACACTGTGATGTTTCAGCACCTGATGCAGAAGCGGAAGCACACCCAGTGGACGTA 
GNTVMFQHLMQKRKHTQWTY 
1030 1050 1070 

TGGACCACTGACCTCGACTCTCTATGACCTCACAGAGATCGACTCCTCAGGGGATGAGCA 
GPLTSTLYDLTEIDSSGDEQ 

1090 1110 1130 

GTCCCTGCTGGAACTTATCATCACCACCAAGAAGCGGGAGGCTCGCCAGATCCTGGACCA 
SLIiELIITTKKREARQILDQ 
1150 1170 1190 

GACGCCGGTGAAGGAGCTGGTGAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTG 
TPVKELVSLKWKRYGRPYFC 

1210 1230 1250 

CATGCTGGGTGCCATATATCTGCTGTACATpATCTGCTTCACCATGTGCTGCATCTACCG 
MLGAIYLLYIICFTMCCIYR 
1270 1290 1310 
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CCCCCTCAAGCCC AGG ACCAAT AACCG C AC AAGCCCCCGGGACAAC AC CCT CTT ACAGC A 
P LKPRTNURTSFRD'NTLLQQ 
1330 1350 1370 

GAAGCTACTTC^.GGAAGCCTACGTGACCCCTAAGGACGATATCCGGCIX^TCGGGGAGCT 
KLLQEAYVTPKDDIRLVGEL 
1390 1410 1430 

GGT GACTGT C ATTGGGGCTATCATCATCCTGCTGGTAG AGGTT CCAGAC AT CTTCAG AAT 
VTVTGAIIILLVEVPDI F R M 
1450 1470 1490 

GGGGGTraCTCGCTTCTTTGGA(^GACCATCXTTGGGGGCCCATTCCATGTCCTCATC^^ 
GVTRFFGQTILGGPFHVL I I 

1510 1530 1S50 

C^CCTATGCCTTCATGGTGCTGGTGACCATGGTGATGC^GCTCATCAGTGCCAGCGGGGA 
TVAFMVLVTMVMRLISASGE 

1570 1590 1€10 

GGT'GGTACCCATGTCCTTTGCACTCGTGCTGGGCTGGTGCAACGTCATGTACTTCGCCCG 
VVPMSFALV'LG WCNVMY FAR 

1630 1650 1670 

AGGATTCCAGATGCTAGGCCCCTTCACCATCATGATTCAGAAGATGATTTTTGGCGACCT 
GFQMLGPFTIMIQKMIFGDL 

1690 1710 1730 

GATGCGATTCTGCTGGCTGATGGCTGTGGTCATCCTGGGCTTTGCTTCAGCCTTCTATAT 
MRFCWLMAVVILGFASAFYI 

1750 1770 1790 

CATCTTCraGACAGAGGACCCCGAGGAGCTAGGCCACTTCTACGACTACCCCATGGCCCT 
I FQTEDPEELGHFYDYPMAL. 
1810 1830 1850 

GTTCAGCACCTTCGAGCTGTTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGA 
FSTFELFLTIIDGPANYNVD 
1870 1890 1910 

CCTGCCCTT C ATGTAC^GCATCACCTATGCTGCCTTTGCC AT CAT CGCCAC ACTGCT CAT 
LPFMYS ITYAAFAI IAT LLH 
1930 1950 1970 

GCTCAACCTCCTCATTGCCATGATGGGCGA(^CrraCTGGCGAGTGGCCCATGAGCGGGA 
LNLLIAMMGDTHWRVAHERD 
1990 2010 2030 

TGAG CTGTGGAGGGCCCAGATTGTGGCCACCACGGTGATGCTGG AGCGG AAG CTGC CT CG 
E L WRAQIVATTVMLERKLPR 

20.50 2070 2090 

CTGCCTGTGGCCTCGCTCCGGGATCTGCGGACGGGAGTATGGCCTGGGGGACCGCTGGTT 
CLWPRSGICGREYGLGDRMF 
2110 2130 2150 

CCTGCGGG TGGAAGACAGGCAAGATCTCAACCGGCAGCGGATC CAACG CTACGCACAGGC 
LRVEDRQ DLNRQRIQRYAQA 

2170 2190 2210 

CTTCCACACCCGGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGG 
FHTRGSEDLDKDS-VEKLELG 

2230 2250 2270 

CTGTCCCTTCAGCCCCCACCTGTCCCTTCCTACGCCCTCAGTGTCTCGAAGTACCTCCCG 
CPFSPHLSLPTPSVSRSTSR 

2290 2310 2330 

CAGCAGTGCCAATTGGGAAAGGCTTCGGCAAGGGACCCTGAGGAGAGACCrGCGTGGGAT 
S SANWERLRQGTLRRDLRGI 

2350 2370 2390 

AATCAACAGGGGTCTGGAGGACGGGGAGAGCTGGGAATATCAGATCTGACTGCGTGTTCT 

INRGLEDGESWEYQI 

2410 2430 2450 

CACTTCGCTTCCTGGAACTTGCTCTCATTTTCCTGGGTGCATCAAACAAAACAAAAACCA 

2470 2490 2510 

AACACCCAGAGGTCTCATCTCCCAGGCCCCAGGGAGAAAGAGGAGTAGCATGAACGCCAA 

2530 2550 2570 

GGAATGTACGTTGAGAATCACTGCTCCAGGCCTCCATTACTCCTTCAGCTCTGGGGCAGA 
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2590 2610 2630 

GGAAGCCCAGCCCAAGCACGGGGCTGGCAGGGCGTGAGGAACTCTCCTGTGGCCTGCTCA 

2650 2670 2690 

TCACCCTTCCGACAGGAGCACTGCATGTCAGAGCACTTTAAAAACAGGCCAGCCTGCTTG 

2710 2730 2750 

GGCCCTCGGTCTCCACCCCAGGGTCATAAGTGGGGAGAGAGCCCTTCCCAGGGCACCCAG 

2770 2790 2810 

GCAGGTGCAGGGAAGTGCAGAGCTTGTGGAAAGCGTGTGAGTGAGGGAGACAGGAACGGC 

2830 2850 2870 

TCTGGGGGTGGGAAGTGGGGCTAGGTCTTGCCAACTCCATCTTCAATAAAGTCGTTTTCG 

2890 2910 
GATCCCTAAAAAAAAAAAAAAAAAAAAAAAAAA 



MGLSLPKEKGLILCLWSKFCRWFQRRESWAQSRDEQNLLQQ 
ALYDNLEAAMVLMEAAPELVFEPMTSEL^ 
VNSEEIVRLLIEHGADIRAQDSliGNTVLHIL 
LMQKRKHTQWTYGPLTSTLYDLTEIDSSGDEQSLLELIIT^ 

MCCI YRPLKPRTNNRTS PRDNTI#LQQKLLQEAYVTPKDDIRLVGELVTVIGAI I ILLVEVPDI FRMGVTRFFGQTIIiGGPFHVLI I 
TYAFMVLVTMVMRL I SASGE WPMS FALVLGWCNVMYFARGFQMLGPFT IMIQKMI FGDLMRFCWLMAVVILGFASAFYI IFQTED 
PEELGHFYDYPMALFST FELFLTI I DGPAN YNVDLPFMYS IT YAAFAI I ATLUtfLNLLI AMMGDTHWRVAHERDELWRAQIVATTV 
MLERKLPRCLWPRSGICGREYGI/3DRWFLRVEDRQDLNRQRIQRYAQ 
SRS SANWERLRQGTLRRDLRG I INRGLE DGESWE YQI 
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Figure 8: 



A v ATGGGTTTGTCACTGCCCAAGGAGAAAGGGCTAATTCTCT 

MGLSLPKEKGLILC 

250 270 290 

GCCTATGGAGCAAGTTCTGCAGATGGTTCCAGAGACGGGAGTCCTGGGCCCAGAGCCGAG 
LWSKFCRWFQRRESWAQSRD 

310 330 350 

ATGAGCAGAACCTGCTGCAGC^GAAGAGGATCTGGGAGTCTCCTCTCCTTCTAGCTGCCA 
EQNLLQQKRIWESPLLLAAK 

370 390 410 

AAGATAATGATGTCCAGGCCCTGAACAAGTTGCTCAAGTATGAGGATTGCAAGGTGCACC 
DNDVQALNKLLKYEDCKVHQ 

430 450 470 

AGAGAGGAGCCATGGGGGAAACAGCGCTACACATAGCAGCCCTCTATGACAACCTGGAGG 
RGAMGETALHIAALYDNLEA 

490 510 530 

CCGCCATGGTGCTGATGGAGGCTGCCCCGGAGCTGGTCTTTGAGCCCATGACATCTGAGC 
AMVLMEAAPELVFEPMTSEL 

550 570 590 

TCTATGAGGGTCAGACTGCACTGCACATCGCTGTTGTGAACCAGAACATGAACCTGGTGC 

YEGQTALHIAVVNQNMNLVR 

610 630 650 

GAGCCCTGCTTGCCCGCAGGGCCAGTGTCTCTGCCAGAGCCACAGGCACTGCCTTCCGCC 
ALLARRASVSARATGTAFRR 

670 690 710 

GTAGTCCCTGCAACCTCATCTACTTTGGGGAGCACCCTTTGTCCTTTGCTGCCTGTGTGA 
SPCNLIYFGEHPLSFAACVN 
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Fig. 8 / continue -n 1 

730 750 770 

ACAGTGAGGAGATCGTGCGGCTGCTCATTGAGCATGGAGCTGACATCCGGGCCCAGGACT 
SEEIVRLLIEHGADIRAQDS 
790 810 830 

CCCTGGGAAACACAGTGrTACACATCCTCATCCTCCAGCCCAACAAAACCTTTGCCTGCC 
LGNTVLHILILQPNKTFACQ 
850 870 890 

AGArGTACAACCTGTTGCTGTCCTACGACAGACATGGGGACCACCTGCAGCCCCTGGACC 
MYNLL1*SYDRHGDHLQPLDL 
910 930 950 

TCGTGCCCAATCACCAGGGTCTCACCCCTTTCAAGCTGGCTGGAGTGGAGGGTAACACTG 
VPNHQGLTPFKLAGVEGNTV 
970 990 1010 

TGATGTTTCAGCACCTGATGCAGAAGCGGAAGCACACCCAGTGGACGTATGGACCACTGA 
MFQHLMQKRKHTQWTYGPLT 
1030 1050 1070 

CCTCGACTCTCTATGACCTC^CAGAGATCGACrcCTCAGGGGATGAGCAGTCCCTGCTGG 
ST LYDLTEIDSSGDEQSLLE 
1090 1110 1130 

AACTTATCATCACCACCAAGAAGCGGGAGGCTCGCCAGATCCTGGACCAGACGCCGGTGA 
LI ITTKKREARQ ILDQTPVK 
1150 1170 - 1190 

AGGAGCTGGTGAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTGCATGCTGGGTG 
ELVSIrKWKRYGRFYFCM LGA 
1210 1230 1250 

CCAT ATATCTGCTGT ACATCAT CTGCTT C ACC ATGT G CTGCATCT ACCG CCCCCT C AAGC 
IYLLYIICFTMCCIYRPLKP 
1270 1290 1310 

CCAGGACCAATAACCGCACGAGCCCCCGGGACAACACCCTCTTACAGC^GAAGCTACTTC 
RTNNRTS PRDNTLLQQKLLQ 
1330 1350 1370 

AGGAAGCCT ACATG ACCCCT AAGGACGAT AT CCGG CTGGTCGGGG AGCT GGTGACTGT C A 
EAYMTPKDDIRLVGELVTVI 
1390 1410 1430 

TTGGGGCTATC AT CATCCTGCTGGTAGAGGTT CCAGACATCTTC AGAAT GGGGGT CACTC 
G A I I ILLVEV PDIFRMGVTR 
1450 1470 1490 

GCl'TCTTTGGACAGACCATCCTTGGGGGCCCATrCCATGTCCTCATCATCACCTATGCCT 
FFGQT ILGGP F H V L I I T Y A F 
1510 1530 1550 

TCATGGTGCTGGTGACCATGGTGATGCGGCTCATCAGTGCCAGCGGGGAGGTGGTACCCA 
MVLVTMVMRLI SASGEVVPM 
1570 1590 1610 

TGTCCTOTGCACTCGTGCTGGGCTGGTGCAACGTCATGTACTTCGCCCGAGGATTCCAGA 
S FALVLGWCNVMYFARGFQM 
1630 1650 1670 

TGCT AGGCCCCTTCAC C ATC ATGATTC AG AAGAT G ATTTTTGG CGACCTGATGCGATTCT 
IiGPFTIMIQKK IFGDLMRFC 
1690 1710 1730 

GCTGGCTGATGGCTGTGCTCATCCTGGGCTTTGCTTCAGCCTTCTATATCATCTTCCAGA 
WLMAVVI LG FASAFYI IFQT 
1750 1770 1790 

CAGAGGACCCCGAGGAGCTAGGCCACTTCTACGACTACCCCATGGCCCTGrrCAGCACCT 
EDPEELGHFYDYPMALFSTF 
1810 1830 1850 

TCG AGCTG TT CCT T ACCAT CAT CGATGGCCCAG CC AACTACAACGTGGACCTGC CCTTC A 
ELFLTI IDGPANYNVDLPFM 
1870 1890 1910 

TGTACAGCATCACCTATGCTGCCTTTGCCATCATCGCCACACTGCTCAT5CTCAACCTCC 
YSITYAAFAI IATLLMLNIiL 
1930 1950 1970 

TCATTGCCATGATGGGCGACACTCACTGGCGAGTGGCCCATGAGCGGGATGAGCTGTGGA 
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Fig. 8 / continui on 2 
iammgdt hwrvahe rdelwr 

1990 2010 2030 

GGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGC 
AQIVATTVMLER KLPRCLWP 

2050 2070 2090 

CrCGCTCCGGGATCTGCGGACGGGAGTATGGCCTGGGAGACCGCTGGTTCCTGCGGGTGG 
RSGICGREYGLGDRWFLRVE 

2110 2130 2150 

AAGACAGGCAAGATCTCAACCGGCAGCGGATCCAACGCTACGCACAGGCCTTCCACACCC 
DRQDLNRQRIQRYAQAFHTR 

2170 2190 2210 

GGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGGCTGTCCCTTCA 
GSEDLDKDSVEKLELGCPFS 
2230 2250 2270 

GCCCCC ACCTGTCCCTTCCT ATG CCCTC AGT GTCTCGAAGTAC CTCCCGCAGCAGTGCCA 
PHLSLPMPSVSRSTSRSSAN 

2290 2310 2330 

ATTGGGAAAGGCTTCGGCAAGGGACCCTGAGGAGAGACCTGCGTGGGATAATCAACAGGG 
WERLROGTLRRDLRGIINRG 
2350 2370 2390 

GTCTGGAGGACGGGGAGAGCTGGGAATATCAGATCTGA 
LEDGESWEYQI * 



MGL S LPKEKG LI LCLW SKFCRW FQRRESWAQSRDEQN LLQQKRI WE S PLLLAAKDNDVQ ALNKLLKYE DCKVHQRG AMGET ALH I A 
ALYDNIJS21AMVLMEAAPELVET5PMTSELYEGQTA 

VNSEEIVRLLIEHGADIRAQDSLGNTVLHILILQPNKTFACQMYNLLI^ 

LMQKRKHTQWT YGPLTSTLYDLTEIDSSGDEQSLLEL I ITTKKREARQI LDQTPVKELVSLKWKRYGRPYFCMIX3AI YLLY 1 1 CFT 
MCCI YRPLKPRTNNRTS PRDNTLLQQKLLQEAYMT PKDDIRLVGELVTVIGAI I ILLVEV PDI FRMGVTRFFGQTI LGG PFHVLI I 
T Y AFMVLVTMVMRLI S ASGE W PMSFALVLGW CNVMY FARGFQMLG PFT I M I QKM I FG DLMRFCW LMAW I LG FAS AFY 1 1 FQTE D 
PEELGHFYDYPMALFSTFELFLTIIDGPANYtTVDLPFW^ 

MLERKLPRCLWPRSGICXSREYGLGDRWFIJIVEDRQDLNRC^IQRYAQAFHTRGSEDLD^ 
SRSSANWERLROGTLRRDLRGIINRGLEDGESWEYQI 



CAAACTC^CAGCCCTCTCCAAACTGGCTGGGGCTGCT^ 
CCTCTACAGGGAGACGGTGGGCCGGCCCTTGGGGGGGCTGATGT^^ 

GGCCCCCAAGGAGCCGGCCCTACACCCC^TGGGTTTGTCACTGCCCAAGGAGAAAGGGCTAATTCTCTGCCT 

GCAGATGGTTCCAGAGACGGGAGTCCTGGGCCCAGAGCCGAGATGAGC^GAACCTGCTGCAGCAGAAGAGGATCTGGGAGTCTCCT 
CTCCTTCTAGCTGCCAAAGATAATGATGTCCAGGC^^ 

CATGGGGGAAAC^GCGCTACACATAGCAGCCCTCTATGACAACCTGGAGGCCGCCATGGTGCTGATGGAGGCTGCCCCGGAGCTG^ 

TCTTTGAGCCCATGACATCTGAGCTCTATGAGGGTCAGACTGC^^ 

GCCXTGCTTGCCTOCAGGGCCAGTGTCTCTGCCAGAGC^^ 

GGAGraCCCTTTGTCCTTTGCTGCCTGTGTGAAC^GTGAGGAGATCGTCCGGCTG 

AGG ACT GC CTGGCCCAACAAAACCTTTGCCT GCC AG AT GT AC AACCTGT TG CTGT C CT ACGAC AG ACATGGGG ACC ACCTGCAGCC 

CCTGGACCTCGTGCCCAATCACCAGGGTCTCACCCCTTTCAAGCTGGCTGGAGTGGAGGGTAACACTGTGATGOT 

TGCAGAAG CGG AAGC AC ACCCAGTGG ACGTATGGACCACTGACCTCGACTCTCT ATG ACCT C AC AGAGAT CGACTCCT CAGGGGAT 

G AGCAGT CCCTG CTGG AACTT AT CAT CACC ACC AAGAAGCGGG AGGCTCGC CAGAT CCTGG ACC AGACGCCGGTG AAGG AGCTGGT 

GAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTGCATGCrGGGTGCCATATATCTGCTGTACATCATCTGCTTCACCATGT 

GCTGCAT CTACCGCC CCCTCAAGCCCAGG ACC AAT AACCG C ACAAGCCCCCGGGACAACACC CT CTTACAGC AGAAGCT ACTTC AG 
GAAGCCTACGTGACCCCTAAGGACGATATCCGGCTGGTCGGGGAGCTGGTGACTGTCATTGGGGCTAT(^TCATCCTGCTGGTAGA 

GGTTCCAGACATCTTCAGAATGGGGGTCACTCGCTTCTTT 
ATGCCTTCATGGTGCTGGTGAIXATGGTGATGCGGCTCA^ 

T C AT CCT GGG CT TT GCT T CAG CCTTCT AT ATCATCTTCCAG ACAGAGGACC CCGAG GAGCT AGGCCACTTCTACG ACT ACCCCATG 
GCCCTGTTC^GCACCTTCGAGCTGGTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCrTCATGTACAGCAT 
CACCTATGCTGCCTTTGCCATCATCGCCACACTGCTCATGCTCAACCTCCTCATTGCCATGATGGGCGACACTCACTGGCGAGTGG 
CCCATGAGCGGGATGAGCTGTGGAGGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGCCT 
CGCTCCGGGATCTGCGGACGGGAGTATGGCCTGGQGGACCGCTGGOTCCTGCGGG^ 
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GATCCAACGCTACGCACAGGCCTTCCACACCCGGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGGCTGTC 
CCTTCAGCCX^CCACCTGTCCCTTCCTACGCCCTCAGTGTCTCGAAGTACCTCCCGCAGCAjSTGCCAATTGGG^AAGGCTTCGGCAA 
GGGACCCTGAGG AG AG ACCTG CGTGGG AT AAT C AACAGGGGTCTGG AGG ACGGGG AG AGCT GGG AATATC AGAT CTGACTGCGTGT 
TCTCACTTCGCTTCCTGGAACTTGCTCTCATTTTC 

AGGCCCC&GGGAGAAAGAGGAGTAGCATGAACGCCAAGGAATGTACGTTGAGAATC^^ 

TCTGGGGCAGAGGAAGCCCAGCCCAAGCACGGGGCTGGCAGGGC^ 

GAGCACTGC&TGTCAGAGCTCTTTAAAAACAGGCCAGCCTGCTTGGGCCCTCGCT 

CCCTTCCCAGGGCACCCAGGCAGGTGCAGGGAAGTGCAGAGCrTGTGGAAAGCGTGTGAGTGAGGGAGACAGGAAC 
GTGGGAAGTGGGGCTAGGTCTTGCOUVCTCCATCTT^ 



CAAACT C AC^GCCCTCTC C AAACTGG CTGGGGCTGCTGGG AG ACTCCC AAGGAACTCGTCAGG AAG GCAGGAGACAGG AG ACGGGA 

CCTCTACA£^GAGACGGTGGGCCGGCCCTTGGGGGGGCTGATGTGGCCCO^ 

GGCCCCCAAGGAGCCGGCCCTACACCCC^TGGGTTTGTC^CTGCCCAAGGAGAAAGGGCT 

G C AG ATGGTTCC AGAG ACGGGAGT CCTGGGCCC AG AGCCG AGATG AGCAGAACCTGCTGC AG C AG AAG AGG AT CTGGG AGT CTCCT 
CTCCTTCTAGCTGCCAAAGATAATGAXt^CCAGGCCCTGAACAAGT 

CATGGGGGAAACAGCGCTACAC AT AGCAG CCCTCT ATGAC AACCTGGAGGCCGCCATGGT GCTGATGGAGG CTGCC CCGGAGCTGG 
TCTTTGAGCCCATGACATCTGAGCTCTATGAGGTCCTGACTGCCCA^ ' 
AAGAGGAAGAGATGGGCAGCAGCTGG&TCCCCTGGGAATCCTGAACACCCX5AGA 

GGGAAAGAGACTGGGGTGCATATGGGAGGGACCCCCTGCAGGATCC?rGGGGACAGACCCGTGACTGACAGCTGTCTCT 

TCAGACTGC^CTGCACATCGCTGTTGTGAACCAGAACATGAACCTGGTGre^ 

GAGC(^CAGGCACTGCCTTCCGOCGTAGT(XX:TGCAACCTCATCr^^ 

AGT G AGG AG ATCGTGCGG CT GCTCATT G AGCATGGAGCTGACATCCGGGCCC AGGACTCCCT G GCCCAAC AAAACCTTTGCCTGCC 
AGATGTACAACCTGTTGCTGTCCTACGAC^GACATGGGGACCACCTGC^ 

CCTTTCAAGCTGGCTGGAGTGGAGGGTAACACTGTGATGTTTC^GCACCTGATGCAGAAGCGGAAGCACACCCAGTGGACGTATGG 
AC CACT G ACCT CG ACT CT CT ATGAC CT CACAGAG AT CG ACT C CT CAGGG G AT GAGC AGTCCCTG CTGG AACT TAT CMC AC C ACCA 
AGAAGCGGGAGGCTCGCCAGATCCTGGACCAGACGCCGGTGAAGGAGCTGGTGAGCCrn^^GTGGAAGCGGTACGGGCGGCC^ 
TTCTGCAlt^TGGGTGCCATATATCTGCTGTAC^TCATCTGCTTCA^ 

TAACCGCACGAGCCCCCGGGACAACACCCTCTTACAGCAGAAGCTACTTCAGGAAGCCTACATGACCCCrAAGGACGATATCC 
TGGTCGGGGAGCTGGTGACTGT CATTGGGGCTATCAT CATCCTGCTGGTAGAGGTTCCAGACATCTT CAGAATGGGGGTCACTCGC 
TT CTTTGG AC AGACC ATCCTT GGGGGCCC ATTCCATGTCCTCATCATC ACCT ATGCCTT CATGGTGCTGGTGACCATGGTG ATGCG 
GCTCATC AGT GCCAGCGGGG AGGTGGTACCC ATGT CCTTTGCACTCGTG CTGGG CTGGT GCAACGTCATGT ACTT CGCCCGAGG AT 
TCCAGATGCTAGGCCCCTTCACCATCATGATTCAGAAGATGATTTTTGGCGACCTGATGCGATTCTGCTGGCTGATGGCTGTGGTC 
ATCCTGGGCTTT GCTT AGAC AGAGGACCCCG AGG AGCTAG GCCACTTCT ACG ACT ACCC CACCTTCGAGCT 
GGTCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCT 
TCGCCACACTGCTCATGCTCAACCTCCTCATTGCCATGATGGGCGACACTCACTGGC 

AGGG CCCAG ATTGTGGCC AC C ACGGTGATGCT GGAGCGG AAGCTG CCTCGCTGCCTGTGGCCT CGCT CCGGGATCTQCGGACGGG A 
GTATGGCCTGGGAGACCG CTGGTTCCTGCGG GTGGAAGACAGGCAAGATCTCAACCGGCAG CGGAT CCAACGCTACGCACAGGCCT 
TCCACACCCGGGGCTCTGAGGATTTGGACJU\AGACTCAGTGGA 

CCT ATG CCCTCAGT GTCT CGAAGT ACCTC CCGCAGCAGTGCCS^TTGGG AAAGGC!TT CXK3CAAG 
T GGGATAATC&ACAGGGGTCTGGAGGACGGGGAGAGCTGGGAATATCAGATCTGACTGCG 
GCTCTCATTTTCCTGGGTGCATCAAACAAAACAAAAACO\AACACCCAGAGGTCTCATCTCCCA 

AGC ATGAACG CCAAGG AATGT ACGTT GAG AAT C ACTGCTCXIAGGCCTGCATT ACTCCTT CAGCTCTGGGGCAG AGGAAGCCC AGCC 
CJ^GC ACGGGGCTGGC^GGGCGTGAGGAACTCTCCTG TGGCCTGCT CACT GCATGT CAGAG C ACTT 

TAAAAAraGGCCAGCCTGCTTGGGCCCTCGGTCTC(^C<XC^^ 

GTGCAGGGAAGTGCAGAGCTT GT GGAAAGCGTGTG AGTGAGGG AGACAGGAACGGCT CT GGGGGTGGG AAGTGGGGCTAGGTCTTG 
CCAACTCCATCTTCAATAAAGTCGTTTTCGGATCCCTAAAAAAAAAAAAAAAAAAAAAAAAAA 



CAAACTCACAGCCCTCTCCAAACTGGCTGGGGCTGCTGGGAGACTCCCAAGGAACTCGTCAGGAAGGCAGGAGACAGGAGACGGGA 
CCTCTACAGGGAGACGGTGGGCCGGCCCTTGGGGGGGCTGATGTGGCCCCAAGGC^ 

GGCCCCCAAGGAGCCGGCCCTACACCCCATGGGTTTGTCACTGCCCAAGGAGAAAGGGCTAATTCTCTGCCTATGGAGCAAGTTCT 

GCAGATGGTTCCXGAGACGGGAGTCCTGGGCCC^GAGCCGAGATGAGCAGAACC^^ 

CTC<nTCTAGCTGCCAAAGATAJiTGATGTCCAGGCCCTGAACAAGTTGCTCAA 

CATGGGGGAAACAGCGCTACACATAGCAGCCCTCTATGACAACCTGGAGGCCGCC^TGGTGCrrGATGGAGGCTGCCCCGGAGCTGG 
TCTTTGAGCCCATGACATCTGAGCTCTATGAGGGTCAGACTGCACTGCACTVTCGCTGTTGTGAACCAGAACATGAACCTGGTGaSA 
GCCCTGCTTGCCCGCAGGGCCAGTGTCTCTGCCAGAGCCAC^GGCACTGCCrrC^ 
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AAAC AC AGTGT T AC AC AT CCT CAT C CTCCAGCCC AACAAAACCT TTGCCT GCCAGATGT AC AACCTGTTG CTGT C CT ACG ACAG AC 
ATGGGG ACCAC C^GCAGCCCCT GGACCTCGTGCCC AAT CACCAGGGTCT C ACCCCTTTC AAGCT GG CTGGAGTGG AGGGT AAC ACT 
GTGATGTTTC AGCACCTGATGC AG AAGCGG AAGCAC ACCCAG TGGACG TATGGACCACTG ACCT CG ACTC TCT ATGACCTCACAGA 
GATCGACTCCT C AGGGGATGAGC AGTCCCTGCTGG AACTT AT CAT CACCACC AAG AAGCGGGAGGCT CGCC AG AT CCTGG ACCAG A 
CGCCGGTGAAGGAGCTGGTGAGCCTCAAGTGGAAGCGGTACGGGCGGCCGTACTTCTGCMGCTG 
AT CATCTGCTT C^CCATGTGCTGCATCTACCGCCCXCTCAAGCCXIAGGACCAAT 
ACAGmGAAGCTACTTCAGGAAGCCTACGTGACCCCT^ 

TCATCATCCTGCTGGTAGAGGTTCCAGAC^TCTTa^GAATGGGGGTCACTCGCTTCTTTGGAC^ 
C^TGTCCTC^TCATCACCTATGCCTTCATGGTGCTGGTGACCA 

GTCCTTTGC ACTCGTGCTGGGCTGGTGCAAQ^CATGTACTTCGCCCGAG GATTCCAG ATG CT AGGCCCCTT CACCAT CATGATT C 
AGAAGATGATTTTTGGCGACCTGATGCGATTCTGCTGG CTGATGGCTGTG GTCATCCTGGGCTTTG CTT C AG CCTTCT AT ATCATC 
TTCCAGACAGAGGACCCCGAGGAGCTAGGCCACTTCTACGACTACCCCT^ 
CATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCTTC^TGTACAGCAT 

TCATGCTC3UVCCT CCTCATTGCCATGATGGGCG AC ACT CACTGGCG AGTGGCCCATG AGCGGGATGAGCTGTCGAGGGCCCAGATT 
GTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTC^ 

GGACCGCTGGTTCCTGCGGGTGGAAGACAGGCAAGATCTCAACCGGCAGCGGATCCA 

GCT CTGAGGATTTGGACAAAGACT CAGTGGAAAAACTAGAGCT GGGCTGT CCCTT C AGCCCCC ACCTGTCCCTT CCTACGCCCTCA 

GTGTCTCGAAGTACCTCCCGCAGC^GTGCCAATTGGGAAAGGCTTCGGCA 

CAGGGGTCTGGAGGACGGGGAGAGCTGGGAATATC^GATCTGACTGCGTGTTCTCACTTC 

CTGGGTGCAT CAAACAAAACAAAAACC AAACACCCAGAGGTOTCATCTCCCAGG CC^CCAGGGAGAAAGA ' 
AAGGAATGTACGTTGAGAATCACTGCTCaGGCCTGCATTACTCCT^ 
TGGCAGGGCGTGAGGAACTCTCCTGTGGCCTGCTCATCACCCTTCCGAC^^ 
AGCCTGCTTGGGCCCTCGGTCTCCACCC^GGCTraTAAGTGGGG 

GCAGAGCTTGTGGAAAGCGTGTGAGTGAGGGAGACAGGAACGGCTCTGGGGGTGGG^^ 
TCAATAAAGTCGTTTTCGGATCCCTAAAAAAAAAAAAAAAAAAAAAAAAAA 



'■) 

CACACATGGGGCCT CCCAGG AGTGCCC AGGACCT CGTGCTCTTGGCCTCTGAATCT ATQ7T CTCCAATCCGCTGTCCCACAGAAGC 
CATATAACCCACCTCTCTGTAAATGCCAGGAGCCATGG^ 

CCATGGTGCTGATGGAGGCTGCCCCGGAGCTGGTCrTTGAGCCC^GACAT^ 

GGGTGAAG AG CAGGAGTGACGTGGTTGGGTATT CAAGTCAGTCTCTGTG ATGGAT AATTTGGGAAAGACACAGGGGAT CTGAGCCT 
CCTACTCTTTTTSTCTTCTCTGTCTCCCTTCCG^ 

G(TCTAGAGAAGAGGAAGAGATG<^CAGCAGCTGG^TCCCCTGGGAATCCTG^ 

ATCCCTGAGGGAAAGAGACTAX^GTGCATATGGGAGGGAC^^ 

CTGGGCCAGGTCAGACTGCACrrGCACATCGCTGTTGTC 

GTCTCTGCCAGAGCCACAGGCACTGCCTTCCGCCCTAGTCCCTGC^ 

CTGTGTGAACAGTGAGGAGAXCGTGCGGCTGCTCATTGAGCATGGAGCT 

TTGCTGTCCTACGACAGA(^TGGGGACCACCTGCAGCCCCTGGACCTCGTGCCCAATCACCAGGGTCT 

TGGAGTGGAGGGTAACACTCTGATGTTTCAGCACCTGATGCAGAAGCGGAAGCA 

CTCTCTATGACCTCACAGAGATCGACTCCTC^GGGGATGAGCAGTCCC^ 

CGCCAGATCCTGGACC AG ACG CCGGTGAAGGAGCTGGTGAG CCT CAAGTGG AAGCGGT ACGGGCGG CCGT ACT T CT GCATGCTGGG 

TGCCATATATCTGCTGTACATCATCTGCTTCACCATGTGCTGCATCT 

CCCGGGACSUICACCCTCTTACAGCAGAAGCTACTTCAGGAAGCCTACATGACC^ 

GTGACTGTCATTGGGGCTATCATCATCCTGCTGGTAGAGGTTCCAGACATCT 

C^TCCTTGGGGGCCCATTCCATGTCCTCATC^TCACCTA^ 

GCGGGGAGGTGGTACCCATGT CCTTTGCACTCG TGCTGGGCTGGTGCAACGTCATGTACT TCGCCCGAGGATT C CAG ATGCTAGGC 
CCCTTCAC CATC ATG ATT C AG AAG ATG ATTTTTGGCG ACCTG ATGCG ATTCTG CTGGCTG AT GG CTGTGGT CAT CCTGGGCTTTGC 
T T CAGCCTT CTATAT CAT CTT CCTGACAGAGGACCCCGAGGAGCTAGGCCACTTCTACG ACT ACCCCATGG CCCTGTT(^GC ACCT 
TCGAGCTGGrCCTTACCATCATCGATGGCCCAGCCAACTACAACGTGGACCTGCCCTTCATGTACAGCATCACCTATGCTGCCTTT 
GCmTCATCGCCACACTGCTCATGCTC^CCTCCTCATTGCCATGATGGGCGAC^CTCACTGGCGAGTGGCCC^TGAGa3GGATGA 
GCTGTGGAGGGCCCAGATTGTGGCCACCACGGTGATGCTGGAGCGGAAGCTGCCTCGCTGCCTGTGGCCTCGCTCCGGGATCTGCG 
GACGGG AGTATGGCCTGGG AGACCG CTGGTTCCTG CGGG TGGAAG ACAGGCAAGAT CTCAACCGGCAGCGG ATC CAACGCTACG CA 
CAGGCCTTCCACACCCGGGGCTCTGAGGATTTGGACAAAGACTCAGTGGAAAAACTAGAGCTGGGCTGTCCCTTCAGCCCCmCCT 
GTCCCTTCCTATGCCCTCAGTGTCTCGAAGTACCTCCCGCAGCIAGTGCC^^TTGGGAAAGGCTTCGGCAAGGGACCCTGAGGAGAG 
ACCTGCGTGGGATAATC^CAGGGGTCTGGAGGACGGGGAGAGCTGGGAAT^^ 

GGAACTTGCTCTCATTTTCCTGGGTGCATCAAACA7UVACA7^AAAC<^AAACACCCAGAGGTCTCATCTCCCAGGCCCCAGGGAGA^ 
GAGGAGTAGCATGAACGCCAAGGAATGTACGTTGAGAATCACTGCTCtoGGCCTGCATTACTCCTTCAGCTCTGGGGCAGAGGAAG 
CCCAGCCCAAGCACGGGGCTGGCAGGGCGTGAGGAACTCTCCTGTGGCCTGCTCATCACCCTTCCGACAGGAGCACTGCATGTCAG 
AGCACTTTAAAAACAGGCCAGCCTGCTTGGGCCCTCGGTCTCCACCCCAGGGTCATAAGTGGGGAGAGAGCCCTTCCCAGGGCACC 
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CAGGCAGGTGCAGGGAAGTGCAGAGCTTGTGGAAAGCGTGTGACTGAGGGAGACAGGAACGGCTCTGGGGGTGGGAAGTGGGGCTR 
GGTCTTGCCAACTCCATCTTCAAT&AAGTCGTTTTC 
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Figure 9: 
A. 

10 30 50 

CGGGGCCCTGGGCTGCAGGAGGTTGCGGCGGCCGCGGCAGCATGGTGGTGCCGGAGAAGG 

M V V P E K E 
70 90 110 

AGCAGAGCTGGATCCCCAAGATCTTCAAGAAGAAGACCTGCACGACGTTCATAGTTGACT 
QSW 1 PKt FKKKT CTTFIVDS 
130 150 170 

CCACAGATCCGGGAGGGACCTTGTGCCAGTGTGGGCGCCCCCGGACCGCCCACCCCGCAG 
TDPGGTLCQCGR PRTAHPAV 
190 -210 230 

TGGCCATGGAGGATGCCTTCGGGGCAGCCGTGGTGACCGTGTGGGACAGCGATGCACACA 
A M E D A F. GAAVVTVWDSDAH T 
250 270 290 

CCACGGAGAAGCCCACCGATGCCTACGGAGAGCTGGACTTCACGGGGGCCGGCCGCAAGC 
TEKPTDAYGELD FTGAGRKH 
310 330 350 

AC^GCAATTTCCTCCGGCTCTCTGACCGAACGGATCCAGCTGCAGTTTATAGTCTGGTCA 
S N FLRLS D R T D P A A V Y S L V • T 
370 390 410 

CACGCACATGGGGCTTCCGTGCCCCGAACCTGGTGGTGTCAGTGCTGGGGGGATCGGGGG 
RTWGFRAPNLVVSVLGGSGG 
430 450 470 

GCCCCGTCCTCCAGACCTGGCTGCAGGACCTGCTGCGTCGTGGGCTGGTGCGGGCTGCCC 
PVI*QTWLQ DLLRRGLVRAAQ 
490 510 530 

AGAGCACAGGAGCCTGGATTGTCACTGGGGGTCTGCACACGGGCATCGGCCGGCATGTTG 
STGAWIVTGGLHTGIGRHVG 
550 570 590 

GTGTGGCTGTACGGGACCATCAGATGGCCAGCACTGGGGGCACCAAGGTGGTGGCCATGG 
VAVRDHQMASTGGTKVVAMG 
610 630 650 

GTGTGGCCCCCTGGGGTGTGGTCCGGAATAGAGACACCCTCATCAACCCCAAGGGCTCGT 
VAPWGVVRNRDTLINPKG. SF 
670 690 710 

TCCCTGCGAGGTACCGGTGGCGCGGTGACCCGGAGGACGGGGTCCAGTTTCCCCTGGACT 
PA RYRWRG DPEDGVQFPLDY 
730 750 770 

ACAACTACTCGGCCTTCTTCCTGGTGGACGACGGCACACACGGCTGCCTGGGGGGCGAGA 
NY SAFFLVD DGTHGCLGGEN 
790 810 830 

ACCGCTTCCGCTTGCGCCTGGAGTCCTACATCTCACAGCAGAAGACGGGCGTGGGAGGGA 
RFRLRLE S Y IS QQKTGVGGT 
850 870 890 

CTGGAATTGACATCCCTGTCCTGCTCCTCCTGATTGATGGTGATGAGAAGATGTTGACGC 
GI DIPVLLLLI DGDEKMLTR 
910 930 950 

GAATAGAGAACGCCACCCAGGCTCAGCTCCCATGTCTCCTCGTGGCTGGCTCAGGGGGAG 
IENATQAQLPCLLVAGSGGA 
970 990 1010 

CTGCGGACTGCCTGGCGGAGACCCTGGAAGACACTCTGGCCCCAGGGAGTGGGGGAGCCA 
ADCLAETLEDTLAPGSGGAR 
1030 1050 1070 

GGCAAGGCGAAGCCCGAGATCGAATCAGGCGTTTCTTTCCCAAAGGGGACCTTGAGGTCC 
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Fig. 9 / continu; n 1 

QGEARDRIRRFFPKGDLEVL 
1090 1110 1130 

TGCAGGCCCAGGTGGAGAGGATTATGACCCGGAAGGAGCTCCTGACAGTCTATTCTTCTG 
Q A Q V E R I MTRKELIiTVY S SE 
1150 1170 1190 

AGGATGGGTCTGAGGAATTCGAGACCATAGTTTTGAAGGCCCTTGTGAAGGCCTGTGGGA 
DG S EE FETIVLKALVKAC GS 
1210 1230 1250 

GCTCGGAGGCCTCAGCCTACCTGGATGAGCTGCGTrrGGCTGTGGCTTGGAACCGCGTGG 
SEASAYLDELRLAVAWNRVD 
1270 1290 1310 

ACATTGCCC AG AGTG AACT CT TTCGGGGGGAC ATCCAAT GGCGGTC CTTC CAT CT CGAAG 
IAQSELFRGDIQWRSFHLEA 
1330 1350 1370 

CTTCCCTCATGGACGCCOTGCTGAATGACCGGCCTGAGTTCGTGCGCT 

SLMDALLNDRPEFVRLLISH 
1390 1410 r- 1430 

ACGGCCT CAGCCTGG G CCACTTCCTGACCCCGATGCGCCTGGCCCAACT CT AC AGCGCGG 
GLSLGHFLTPMRLAQLYS AA 
1450 1470 1490 

CGCCCTCCAACTCGCTCATCCGCMCCTTTTGGACCAGGCGTCCCACAGCGCAGGCACCA 
PSN SLIRNL1»DQASHSAGTK 
1510 1530 1550 

AAGCCCCAGCCCTAAAAGGGGGAGCTGCGGAGCTCCGGCCCCCTGACGTGGGGCATGTGC 
APALKGGAAELRP P DVGHV L 
1570 1590 1610 

TGAGGATGCTGCTGGGGAAGATGTGCGCGCCGAGGTACCCCTCCGGGGGCGCCTGGGACC 
RMLLGKMCAPRYP SGGAWDP 
1630 1650 1670 

CTCACCCAGGCCAGGGCTTCGGGGAGAGCTTGTArCTGCTCTCGGAC^^GGCCACCTCGC 
HPGQGFGESMYLILSDKATSP 
1690 1710 1730 

CGCTCTCGCTGGATGC!PGGCCTCGGGC71GGCCCCCTGGAGCGACCTGCTTCTTTGGGCAC 
LSLDAGLGQAPWSDL1/LWAL 
1750 1770 1790 

TGTTGCTGAAnAGGGCACAGATGGCCATGTACTTCTTGGGAGATGGGTTCCAATGCAGTTT 
LLNRAQMAMYFWEMGSNAVS 
1810 1830 1850 

CCTCAGCT.CTTGGGGCCTGTTTGCTGCTCCGGGTGATGGCACGCCTGGAGCCTGACGCTG 
SALGACLLLRVMARLEPDAE 
1870 1890 1910 

AGGAGGCAGCACGGAGGAAAGACCTGGCGTTCAAGTTTGAGGGGATGGGCGTTGACCTCT 
EAARR KDLAFKFEGMGV DLF 
1930 1950 1970 

TTGGCGAGTGCTATCGCAGCAGTGAGGTGAGGGCTGCCCGCCTCCTCCTCCGTCGCTGCC 
GECYRSSEVRAARLLLRRCP 
1990 2010 2030 

CGCTCTGGGGGGATGCCACTTGCCTCCAGCTGGCCATGCAAGCTGACGCCCGTGCCTTCT 
LW G D A T CLQLAM QADARAFF 
2050 2070 2090 

TTGCCCAGGATGGGGTACAG TCTCTGCTGACACAG AAGTGGTGGGGAG ATAT GG CCAG CA 
AQDGVQSLLTQKWWGDMAST 
2110 2130 2150 

CTACACCCATCTGGGCCCTGGTTCTCGCCTTCTTrTGCCCTCCACTCATCTACACCCGCC 
TPIWALVLAFFCPPLIYTRL 
2170 2190 2210 

T CATC ACCTT C AGG AAAT C AG AAGAGG AG C CC ACACGGG AGG AGCT AG AGTTTG ACATGG 
ITFRKSEEEPTREELE F D M D 
2230 2250 2270 

at agtgt catt aatg gg g aagggcctgt cggg acggcggacccagccgag aag acgccgc 
sving egpvgtadpaektpl 

2290 2310 2330 
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Fig. 9 / continue^ in 2 

TGGGG GT CCCGCGCC AGT CGGGCCGT CCGGGTTG CTG CGGGGGCCGCTGCGGGGGGCGCC 
GVPRQSGRPGCCGGRCGGRR 
2350 2370 2390 

GGTGCCTACGCCGCTGGTTXXACTTCTX3GGGCGTGCCGGTGACXATCTO 

CLRRWFH FWGVPVTIFMGNV 
2410 2430 2450 

TGGTCAGCTACCTGCTGTTCCTGCTGCTTTTCTCGCGGGTGCTGCTCGTGGATTTCCAG^ 
VSYLLFLLLFS RVLLVDFQP 
2470 2490 2510 

CGGCGCCGCCCGGCTCCCTGGAGCTGCTGCTCTATTTCTGGGCTTTCACGCT 

A P P G SLELLLYFWAFTIiljCE 
2530 2550 2570 

AGGAACTGCGCCAGGGCCTGAGCGGAGGCGGGGGCAGCCTCGCCAGCGGGGGCCCCGGGC 
ELRQGLSGGGGSLASGGPG P 
2590 2610 2630 

CTGGCCATGCCTCACTGAGCCAGCGCCTGCGCXrrCTACCTCGCCGACAGCTGGAACCAGT 
G H A 5 LSQRLRI/YLADSWNQC 
2650 2670 2690 

GCGACCTAGTGGCTCTCACCTGCTTCCTCCTGGGCGTGGGCTGCCGGCTGACXCCGGGT^ 
DLVALTCFLLGVGCRI*TPGL 
2710 2730 2750 

TGTACCACCTGGGCCGCACTGTCCrrCTGCATCGACTTCATGGTTTTCACQ^GCGGCTGC 
YHLGRTVLCI DFMVFTVRLL 
2770 2790 2810 

TT C ACATCTTC ACGGT CAACAAAC AG CTGGGGCGCAAG AT CGT C ATCGTG AGC AAGATG A 
HIFTVNKQLGPKIVIVSKMM 
2830 2850 2870 

TGAAGGACGTGTTCTTCXTCCTCTTCTTCCTCGGCGTGTGGCTGGTAGCCTATGGCGTGG 
KDVFFFliF FLGVWLVAYGVA 
2890 2910 2930 

eCACGGAGGGGCTCCTGAGGCCACGGGACAGTGACTTCCCAAGTATCCTGCGCCGCGTCT 
TEGLLRPRDSDFPSILRRVF 
2950 2970 2990 

TCTACCGTCCCTACCTGCAGATCTTCGGGCAGATTCCCCAGGAGGACATGGACGTGGCCC 
YRPYLQI FGQI PQEDMDVAL 
3010 3030 3050 

TCATGGAGCACAGCAACTGCTCGTCGGAGCCCGGCTTCTGGGCACACCCTCCTGGGGCCC 
ME'RSNCSSEPGFWAHPPGAQ 
307 0 3090 3110 

AGGCGGGCACCTGCGTCTCCraGTATGCCAACTGGCTGGTGGTGCrrGCTCCTCGT.CATCT 
AGTCVSQYANWLVVLLLVIF 
3130 3150 3170 

TCCTGCTCGTGGCCAACATCCTGCTGGTCAACTTGCTCATTGCCATGTTCAGTTACACAT 
LLVANILLVNLLIAMFSYTF 
3190 3210 3230 

TCGGCAAAGTACAGGGCAACAGCGATCTCTACTGGAAGGCX5CAGCGTTACCGCCTCATCC 
GKV QGN S d'lYWKAQRYR L X R 
3250 3270 3290 

GGGAAT TCCACTCTCGG CCCGCGCTGGCCCCGCCC!TTTATCGT C ATCTCC C ACT T GCGCC 
EFHSRPALAPPFIVISHLRL 
3310 3330 3350 

TCCTGCTCAGGCAATTGTGCAGGCGACCCCGGAGCCCCCAGCCGTCCTCCCCGGCCCTCG 
LLRQLCRRPRSPQPSSPALE 
3370 3390 3410 

AGCATTTCCGGGTTTACCTTTCTAAGGAAGCXX3AGCGGAAGCTGCTAACGTGGGAATCGG 
HFRVYLSKEAERKLLTWE S V 
3430 3450 3470 

TGCATAAGGAGAACTTTCTGCTGGCACGCGCTAGGGACA^GCGGGAGAGCGACTCCGAGC 
HKENFLLARARDKRESDSER 
3490 3510 3530 

gtctgaagcgcacgtcccagaaggtggacttggcactgaaacagctgggacacatccgcg 
lkrts^kvdlalkqlghire 
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Fig. 9 / continua i 3 

3550 3570 3590 

AGTACGAACAGCGCCTGAAAGTGCTGGAGCGGGAGGTCCAGCAGTGTAGCCGCGTCCTGG 
YEQRLKVLEREVQQCSRVLG 

36iO 3630 3650 

GGTGGGTGGCCGAGGCCCTGAGCCGCTCTGCCTTGCTGCCCCCAGGTGGGCCGCCACCCC 
WVAEALSRSALLPPGGPPPP 

3670 3690 3710 

CTGACCTGCCTGGGTCCAAAGACTGAGCCCTGCTGGCGGACTTCAAGGAGAAGCCCCCAC 
DLPGSKD* 

3730 3750 3770 

AGGGGATTTTGCXCCTAGAGTAAGGCTCATCTGGGCCTCGGCCCCCGCACCTGGTGGCCT 

3790 3810 3830 

TGTCCTTGAGGrGAGCCCCATGTCCATCTGGGCCACTGTCAGGACCACCTTTGGGAGTGT 

3850 3870 3890 

CATCCTTACAAACCACAGCATGCCCGGCTCCTCCCAGAACCAGTCCCAGCCTGGGAGGAT 

3910 3930 3950 

CAAGGCCTGGATCCCGGGCCGTTATCCATCTGGAGGCTGCAGGGTCCTTGGGGTAACAGG 

3970 3990 4010 

GACCACAGACCCCTC^CCACTCACAGATTCCTCACACTGGGGi^ 

4030 

GGAAAAAAAAAAAAAAAAAAAA 



MWPEKEQ S W I PKI FKKKT CTT FI VDST D PGGTLCQCGRPRTAH PAVAMEDAFGAAWT VWD S DAHTTEKPT DAYEIiDFTGAGRKH 
SN FLRLS DRT DPAAVYS LVTRT WG FRAPN IjWSVIjGG S GGPVLQTWLQDLLRRGL VRAAQS TGAW I VTGGLHTGIGRHVGVAVRDH 
QMASTGGTKWAMGVAPWG WRNRDTIiI N PKGSFPARYRWRGDPE DGVQFPLrDYNY S AFFLVDOGTHGCIiGGENRFRLRUESY I SQ 
QKTGVGGTG I D I P VLLL L I DG DEKMLTR I EN ATQAH VPCLLVAGSRGLGMPGGTIjE AHLAQDG DHKANQS TNQLLLPKDLSLQP VE 
SIDRKIXQSYSERIAVAWITOVDIAQSELFRGDIQWRSFHLEA 

LIRNLLDQASHSAGTKAPALKGGAAELRP POVGHVLBIMLLGKMCAPRY PSGGAWDPHPGQGFGE SMYLLS DKAT5 PLSLDAGLGQA 
PWSDLLLWALLLNRAQMAMYFWEMGSNAVSSAJ^CLLL^^ 

RRC PLWG DAT CLQLAMQA DARA FFAQ DGVQS I*1/TQKWWG DMASTT P I WALVLA FFC PPLI YTRLIT FRKSEEE PTREELE FDMDS V 
INGE G PVGTADPAEKT PLGV PRQSGRPGCCGGRCGGRRCLRRW FH FWGVPVT I FMGNW S YLLFIX LFS RVLLVDFQPAP PGS I*EI* 
LLYFWAFTLLCEELRQGLSGGGGSLASGGPGPGHASLSQRLRLYIADSVmQCDLVALTCFLLGVGCPJjTPGLYHLGRTXTLCIDFIW 
FTVRLLHI FTVNKQLGPK1VI VSKMMKDVFFFLFFLGWLVAYGVATEGLLRPRDSDFPS ILRRVFYRPYLQIFGQIPQEDMDVAL 
MEH S1S3 C S SEPGFWAHPEGAQAGTCVSQYANWLVVLLLVI FLLVAK ILLVNIiLI AMFSYT FGKVQGNSDL YWKAQRYRLIREFHSRP 
ALAPP FI VI SHLRIJjjRQLCRRPRS PQPS S PALEHFRVYLSKEAERKLLTWES VHKENFLLARARDKRES DSERLKRTSQKVDLAL 
KQXGHIREYEQRLKVLEREVQQCSRVLGWVAEALSRSALLPPGGPPPPDLPGSKD 

10 30 50 

ATCCAATGGCGGTCCTTCCATCTCGAAGCTTCCCTCATGGACGCCCTGCTGAATGACCGG 

70 90 110 

CCTGAGTTCGTGCGCTTGCTCATTTCCCACGGCCTCAGCpTGGGCCACrTCCTGACCCCG 

130 150 170 

ATGCGCCTGGCCCAACTCTACAGCGCGGCGCCCTCCAACTCGCTCATCCGCAACCTTTTG 

190 210 230 

GACCAGGCGTCCCACAGCGCAGGCACCAAAGCCCCAGCCCTAAAAGGGGGAGCTGCGGAG 

250 270 290 

CTCCGGCCCCCTGACGTGGGGCATGTGCTGAGGATGCTGCTGGGGAAGATGTGCGCGCCG 

310 330 350 

AGATGTATCTGCTCrCGGACAAGGCCACCTCGCCGCTCTCGCTGGATGCTGGCCTCGGGC 
MYLLSDKATSPLSLDAGLGQ 
370 390 410 

AGGCCCCCrGGAGCGACCTGCTTCTTTGGGCACTGTTGCTGAACAGGGCACAGATGGCCA 
APWSDLLLWALLLNRAQMAM 
430 450 470 

TGTACTTCrGGGAGATGGGTTCCAATGCAGTTTCCTCAGCTCTTGGGGCCTGTTTGCTGC 
Y FWEMGSNAVSS ALGACLLL 
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490 510 530 

T CCGGGTGATGG C ACGCCTGG AG CCTG ACGCTG AGG AGGCAGCACGGAG G AAAGAC C TGG 
RVMARLEPDAEEAARRKDLA 
550 570 590 

CGTTCAAGTTTGAGGGGATGGGCGTTGACCTCTTTGGCGAGTGCTATCGCAGCAGTGAGG 
FKFEGMGV DLFGECYR SS EV 
610 630 650 

rGAGGGCTGCCCGCCTCCTCCrcCGTCGCTGCCCGCTCTGGGGGGATGCCACTTGCCTCC 
RAARLLLRRCPLWGDATCLQ 
670 690 710 

AGCTGGCCATGCAAGCTGACGCCCGTGCCTTCTTTGCCCAGGATGGGGTACAGTCTCTGC 
LAMQADARAFFAQDGVQSLL 
730 750 770 

TGACACAGAAGTGGT GGGGAGAT ATGG CC AGCACTACACCCATCTGGGCCCTGG TTCTCG 
TQKWWGDMASTT PIWALVI*A 
790 810 830 

CCTTCTTTTGCCCTCCACTCATCTACACCCGCCTCATCACCTTC^GGAAATCAGAAGAGG 
FFC PPLIYTRLI TFRKSEEE 
850 870 890 

AGCCCACACGGGAGGAGCTAGAGTTTGACATGGATAGTGTCATTAATGGGGAAGGGCCTG 
PTREELEFDMDSVINGEG PV 
910 930 950 

TCGGGACGGCGGACCCAGCCGAGAAGAC^CCGCTGGGGGTCCCGCGCCAGTCGGGCCGTC 
GTADPAEKTPLGVPRQSGRP 
970 990 1010 

CGGGTTGCTGCGGGGGCCGCTGCGGGGGGCGCCGGTGCCTACGCCGCTGGrTCCACTTCT 
GCCGGRCGGRRCLRRWFHFW 
103O 1050 1070 

GGGGCGTGCCGGTGACCATCTTCATGGGCAACGTGGTCAGCTACCTGCTGTTCCTGCTGC 
GVPVT1FMGNVVSYLLFLLL 
1090 1110 1130 

TTTTCTCGCGGGTGCTGCTCGTGGATTTCCAGCCGGCGCCGCCCGGCTCCCTGGAGCTGC 
FSRVLLVD FQPAPPGSLELL 
1150 1170 1190 

TGCTCTATTTCTGGGCTTTCACGCTGCTGTGCGAGGAACTGCGCCAGGGCCTGAGCGGAG 
LYFWAFTLLCEE LRQGLSGG 
1210 1230 1250 

GCGGGGGCAGCCTCGCCAGCGGGGGCCCCGGGCCTGGCCATGCCTCACTGAGCCAGCGCC 
GG S LASGGP GPGHASL S Q R L 
1270 1290 1310 

TGCGCCTCTACCTCGCCGACAGCTGGAACCAGTGCGACCTAGTGGCTCTCACCTGCTTCC 
RLYLADSWN QCDLVALTCFL 
1330 1350 1370 

TCCTGGGCGTGGGCTGCCGGCTGACCCCGGGTTTGTACCACCTGGGCCGCACTGTCCTCT 
LGVGCRLT PGLYHLGRTVLC 
1390 1410 1430 

GCATCGACTTCATGGTXTTCACGGTGCGGCTGCTTCACATCTTCACGGTCAACAAACAGC 
IDFMVFTVRLLHIFTVNKQL 
1450 1470 1490 

TGGGGCCCAAGATCGTCATCGTGAGCAAGATGATGAAGGACGTGTTCTTCTTCCTCTTCT 
GPKIVIVSKMMKDVFFFLFF 
1510 1530 1550 

TCCTCGGCGTGTGGCTGGTAGCCTATGGCGTGGCCACGGAGGGGCTCCTGAGGCCACGGG 
LGVWLVAYGVATEGLLR prd 
1570 1590 1610 . 

ACAGTG ACTT CC C AAGTATCCTGCG CCG CG TCTT CTACCGTCCCT ACCTGCAG AT CT TCG 
SDFPSILRRVFYRPYLQ IFG 
1630 1650 1670 

GGCAGATTCCCCAGGAGGACATGGACGTGGCCCTCATGGAGCACAGCAACTGCTCGTCGG 
QIPQEDMDVALMEHSNCSSE 
1690 1710 1730 

AGCCCGGCTTCTGGGCACACCCTCCTGGGGCCCAGGCGGGCACCTGCGTCTCCCAGTATG 
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PGFWAHP PGAQAGTCVSQYA 
1750 1770 1790 

CCAACTGGCTGGTGGTGCTGCTCCTCGTCATCTTCCTGCTCGTGGCCAACATCCTGCTGG 
NWLVV LLLVI FLLVAN ILLV 
1810 1830 1850 

TCAACTTGCTCATTGCCATGTTCAGTTACACATTCGGCAAAGTACAGGGCAACAGCGATC 
NLLIAMFSYTFGKVQGNSDL 
1870 1890 1910 

TCTACTGGAAGGCGCAGCGTTACCGCCTCATCCGGGAATTCCACTCTCGGCCCGCGCTGG 
YWKAQRYRLI REFHSR P A L A 
1930 0.950 1970 

CCCCGCCCT TTATCGT CATCTCCCACTTGCGCCTCCTGCTCAGGCAATTGTGCAGGCGAC 
PPFIV I SHLR LLLRQLCR RP 
1990 2010 2030 

CCCGGAGCCCCCAGCCGTCCTCCCCGGCCCTCGAGCATTTCCGGGTTTACCTTTCTAAGG 
RSPQPSSPALE.HFRVYLSKE 
2050 2070 2090 

AAGCCGAGCGGAAGCTGCTAACGTGGGAATCGGTGCATAAGGAGAACTTTCTGCTGGCAC 
AERKLLTWE SVHKENFLLA. R 
2110 2130 2150 

GCGCTAGGGACAAGCGGGAGAGCGACTCCGAGCGTCTGAAGCGCACGTCCCAGAAGGTGG 
ARDKRESDSERLKRTS QKVD 
2170 2190 2210 

ACTTGGCACTGAAACAGCTGGGACACATCCGCGAGTACGAACAGCGCCTGAAAGTGCTGG 
LALKQLGHIREYEQRLKVLE 
2230 2250 2270 

AGCGGGAGGTCCAGCAGTGTAGCCGCGTCCTGGGGTGGGTGGCCGAGGCCCTGAGCCGCT 
REVQQCSRVLGWVAEALSRS 
2290 2310 2330 

CTGCCTTGCTGCCCCCAGGTGGGCCGCCACCCCCTGACCTGCCTGGGTCCAAAGACTGAG 
ALLP PGGPP PPDLPGSKD* 
2350 2370 2390 

CCCTGCTGGCGGACTTCAAGGAGAAGCCCCCACAGGGGATTTTGCTCCTAGAGTAAGGCT 

2410 2430 2450 

CATCTGGGCCTCGGCCCCCGCACCTGGTGGCCTTGTCCTTGAGGTGAGCCCCATGTCCAT 

2470 2490 2510 

CTGGGCCACTGTCAGGACCACCTTTGGGAGTGTCATCCTTACAAACCACAGCATGCCCGG 

2530 2550 2570 

CTCCTCCCAGAACCAGTCCCAGCCTGGGAGGATCAAGGCCTGGATCCCGGGCCGTTATCC 

2590 2610 2630 

ATCTGGAGGCTGCAGGGTCCTTGGGGTAACAGGGACCACAGACCCCTCACCACTCACAGA 

2650 2670 2690 

TTCCTCACACTGGGGAAATAAAGCCATTTCAGAGGAAAT^AAAAAAAAAAAAAAA 



myllsdkatsplsldaglgqapwsdlllwalllnraqmamye^ 
gvdlfgecyrssevraarlllrrcplwgdatclqlamqadaraffaqdgvqsl^ 

tbtucseeeptreelefdmdsvingegpvgtadpaektplgvprqsgrpgccggrcggrrclrrwfhfwgvpvtifmgnvvsyllfl 

llfsrvllvdfqpappgslelllyfwaftllceelrqglsggggslasggpgpghaslsqrlrlyladswnqcdlvaltcfllgvg 

crltpglyhlgrtvlcidfiwftvrllhiftvnkqi^pk^ 

eyrpymipgqipqedmdvai^hsncssepgfwahpfgaqagtct 

nsdlywkaqryrlirefhsrpai^ppfivishlrlllrqlcrrprspqp 

dkresdserlkrtsqkvdialkqlghireyeqrlkvlerevqqcsrvlgwvaealsrsallppggppppdlpgskd 
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10 30 50 

ATTA3^GTTTKTAAAACAGTGGCTGGATGGTTGGAGGATGCAGGTGGACAGAKGACGTGG 

MVGGCRWTEDVE 
70 90 110 

AGCCTGCAGAAGTAAAGGAAAAGATGTCCTrTCGGGCAGCCAGGCTCAGCATGAGGAACA 
PAEVKEKMSFRAARLSMRNR 
130 150 170 

GAAGGAATGACACTCTGGACAGCACCCGGACCCTGTACTCCAGCGCGTCTCGGAGCACAG 
RNDT. LDSTRTLYSSASRSTD 
190 210 230 

ACTTGTCTTACAGTGAAAGCGCCAGCTTCTACGCTGCCTTCAGGACACAGACGTGCCCAA 
LSYSESASFYAAFRTQTCPI 
250 270 290 

TCATGGCTTCTTGGGACTTGGTGAATTTTATTCAAGCAAATTTTAAGAAACGAGAATGTG 
MAS WDLVNFI QANFKKRECV 
310 330 350 

TCTTCTTTACCAAAGATTCCAAGGCCACGGAGAATGTGTGCAAGTGTGGCTATGCCCAGA 
FFTKDSKATENVCKCGYAQS 
370 390 410 

GCCAGCACATGGAAGGCACCCAGATCAACCAAAGTGAGAAATGGAACTACAAGAAACACA 
QHMEGTQINQSEKWNYKKHT 
430 450 470 

CCAAGGAATTTCCTACCGACGCCTTTGGGGATATTCAGTTTGAGACACTGGGGAAGAAAG 
KE FPTDAFGDIQFETLGKK G 
490 510 530 

GGAAGTATATACGTCTGTCCTGCGACACGGACGCGGAAATCCTTTACGAGCTGCTGACCC 
KYI RLSCDTDAEILYELLTQ 
550 570 590 

AGCACTGGCACCTGAAAACACCCAACCTGGTCATTTCTGTGACCGGGGGCGCCAAGAACT 
HWH LKTPNLVI SVTGGAKN F 
610 630 650 

TCGCCCTGAAGCCGCGCATGCGCAAGATCTTCAGCCGGCTCATCTACATCGCGCAGTCCA 
ALK PRMRKIFSRLIYIAQSK 
670 690 710 

AAGGTGCTTGGATTCTCACGGGAGGCACCCATTATGGCCTGATGAAGTACATCGGGGAGG 
GAW ILTGGTHYGLMKYIGEV 
' 730 750 770 

TGGTGAGAGATAACACCATCAGCAGGAGTTCAGAGGAGAATATTGTGGCCATTGGCATAG 
VRDNTISRSSEENIVAIGIA 
790 810 830 

CAGCTTGGGGCATGGTCTCCAACCGGGACACCCTCATCAGGAATTGCGATGCTGAGGGCT 
AWGMVSNRDTLIRNCDAEGY 
850 870 890 

ATTTTTTAGCCCAGTACCTTATGGATGACTTCACAAGAGATCCACTGTATATCCTGGACA 
FLAQYLMDDFTRDPLYILDN 
910 930 950 

ACAACCACACACATTTGCTGCTCGTGGACAATGGCTGTCATGGACATCCCACTGTCGAAG 
NHTHLLLVDNGCHGHPTVEA 
970 990 1010 

CAAAGCTCCGGAATCAGCTAGAGAAGTATATCTCTGAGCGCACTATTCAAGATTCCAACT 
KLRNQLEKYISERTIQDSNY 
1030 1050 1070 

ATGGTGGCAAGATCCCCATTGTGTGTTTTGCCCAAGGAGGTGGAAAAGAGACTTTGAAAG 
GGK1PIV.CFAQGGGKETLKA 
i090 1110 1130 

CCATCAATACCTCCATCAAAAATAAAATTCCTTGTGTGGTGGTGGAAGGCTCGGGCCAGA 
INT5IKNKIPCVVVEGSGQI 
1150 1170 1190 

TCGCTGATGTGATCGCTAGCCTGGTGGAGGTGGAGGATGCCCTGACATCTTCTGCCGTCA 
ADVIASLVEVEDALTSSAVK 
1210 1230 1250 
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AGGAGAAGCTGGTGCGCTTTTTACCCCGCACGGTGTCCCGGCTGCCTGAGGAGGAGACTG 
EKLVRFLPRTVSRLPEEETE 
1270 1290 1310 

AGAGTTGGATCAAATGGCTCAAAGAAATTCTCGAATGrTCTCACCTATTAAOi.GTTATTA 
SHZKWLKEILBCSHLLTVZX 
1330 1350 1370 

AAATGGAAGAAGCTGGGGATGAAATTGTGAG CAAT GCCAT CTCCT ACGCTCT AT ACAAAG 
MEEAGDEIVSNAISYALYKA 
1390 1410 1430 

CCTTC AGC ACCAGTGAGC AAG ACAAGGAT AACTGG AATGGGC AGCTGAAG CT"T CTGCTGG 
FSTSEQDKDNWNGQLKLLLE 
1450 1470 1490 

AGTGGAACCAGCTGGACTTAGCCAATGATGAGATTTTCACCAATGACCGCCGATGGGAGA 
WNQLDLANDEI FTNDRRWEK 
1510 1530 1550 

AGAGCAAACCGAGGCTCAGAGACACAATAATCCAGGTCACATGGCTGGAAAATGGTAGAA 
SKPRLRDTI I QVTWLENGRI 
1570 1590 1610 

TCAAGGTTGAGAGCAAAGATGTGACTGACGGCAAAGCCTCTTCTCATATGCTGGTGGTTC 
KVE SKDVTDGKAS SHMLVVL 
1630 1650 1670 

TCAAGTCTGCTGACCTTCAAGAAGTCATGTTTACX3GCTCTCATAAAGGACAGACCCAAGT 
KSADLQEVM FT.ALI KDRPKF 
1690 1710 1730 

TTGTCCGCCTCTTTCTGGAGAATGGCTTGAACCTACGGAAGT'rTCTCACCCATGATGTCC 
VRLFLENGLNLRKFLTHDVL 
1750 1770 1790 

TCACTGAACTCTT CT CCAACCACTTCAGCACGCT TGrGTACCGGAAT CTGCAGAT CGCCA 
TELFSNHFSTLVYRNLQIAK 
1810 1830 1850 

AG AATTCCTATAATGATG CC CTCCTCACGTTT GTCTGGAAACTGGTTGCGAACTT CCGAA 
NSYNDALLTFVWKLVANFRR 
1870 1890 1910 

GAGGCTTCCGGAAGGAAGACAGAAATGGCCGGGACGAGATGGACATAGAACTCCACGACG 
GFRKEDRNGRDEMDIELHDV 
1930 1950 1970 

TGTCTCCTATTACTCGGCACCCCCTGCAAGCICTCTTC^TCTGGGCCATTCTTCAGAATA 
SP ITRHPLQALFIWAI LQNK 
1990 2010 2030 

AGAAGGAACTCTCCAAAGTCATTTGGGAGCAGACCAGGGGCTGCACTCTGGCAGC.CCTGG 
KELSKVIWEQTRGCTLAA1.G 
2050 2070 2090 

GAGCCAGCAAGCTTCTGAAGACTCTGGCCAAAGTGAAGAACGACATCAATGCTGCTGGGG 
ASKLLKTLAKVKND I NAAGE 
2110 2130 2150 

AGTCCGAGGAGCTGGCTAATGAGTACGAGACCCGGGCTGTTGGTGAGTCCACAGTGTGGA 
SEELANEYETRAV GESTVWN 
2170 2190 2210 

ATGCTGTGGTGGGCGCGGATCTGCCATGTGGCACAGACATTGCCAGCGGCACTCATAGAC 
AVVGADLPCGTD IASGTHR P 
2230 2250 2270 

C AGATGGTGGAGAGCTGFTCACTGAG? GT TACAG C AGCG ATG AAGACTTGGCAGAACAGC 
DGGE LFTECYS SDEDLAEQL 
2290 2310 2330 

TGCTGGTCTATTCCTGTGAAGCTTGGGGTGGAAGCAACTGTCTGGAGCTGGCGGTGGAGG 
LVYSCEAWGGSNCLELAVEA 
2350 2370 2390 

CCACAGACCAGCArTTCATCGCCCAGCCTGGGGTCCAGAATTTTCrTTCTAAGCAATGGT 
TDQHFIAQPGVQNFLSKQWY 
2410 2430 2450 

ATGGAGAGATTTCCCGAGACACCAAGAACTGGAAGATTATCCTGTGTCTGTTTATTATAC 
GEISRDTKNWKIILCLFIIP 
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2470 2490 2510 

CCTTGGTGGG CTGTGGCTTT GTATCATTT AG GAAG AAACCTGTCG ACAAGCAC AAGAAGC 
LVGCGFVSFRKKPVDKHKKL 
2530 2550 2570 

TGCTTTGGT ACT ATGTGGCGTT CTTC ACCTCCCCCTTCGTGGTCTTCTCCTGGAAT GTGG 
LWYYVAFFTS PFVVFSWNVV 
2590 2610 2630 

TCTTCTACATCGCCTTCCTCXrrGCTGTTTGCCT^ 

FYIAFLXLFAYVLLMDFHSV 
2650 2670 2690 

TGCCACACCCCCCCGAGCTGGTCCTGTACTCGCTGGTCTTTGTCCTCTTCTGTGATGAAG 
PHPPELVLYS LVFVLFC DE V 
2710 2730 2750 

TGAGACAGGGCCGGCCGGCTGCTCCC^GTGCGGGGCOCGCCAAGCCCACGCCCACCCGGA 
RQGRPAAPSAGPAKPTPTRN 
2770 2790 2810 

ACTCCATCTGGCCCGCAAGCrCCACACGCAGCCCCGGTTCCCGCTCACGCCACTCCTTCC 
SIWPASSTRS PGSRSRHS F H 
2830 - 2850 2670 

ACACTTCCCTGCAAGCTGAGGGTGCCAGCTCTGGCCTTGGCCAGCCCAGAAAGGGGTGGA 
TSLQAEGAS S GLGQP .RKGW T 
2890 2910 2930 

CATTTAAAAATCTGGAAATGGTTGAT ATTT CCAAGCTGCTGATGTCCCT CTCTGTCCCT T 
FKNI-EMVDISKLL.MSI.SVPF 
2950 2970 2990 

TCTGTACGCAGTGGTACGTAAATGGGGTGAATTATTTTACTGACCTGTGGAATGTGATGG 
C1QWYVNGVNYFTDLWNVMD 
3010 3030 3050 

ACACGCTGGGGCTTTTTTACTTCATAGCAGGAATTGTATTTCGGC^AGGGATCCTTAGGC 
TLGLFYFIAGIVFRQGILRQ 
3070 3090 3110 

AGAATGAGCAGCGCTGGAGGraGATATTCCGrTCGGTCATCTACGAGCCCTACCTGGCCA 
NEQRWRWI FRSVIYEPYLAM 
3130 3150 3170 

TGTTCGGCCAGGTGCCCAGTGACGTGGAT GGTACC ACGT ATGACTTTGCCCACT GCAC CT 
FGQVPSDVDGTTYDFAHCTF 
3190 3210 3230 

TCACrGGGAATGAGTCCAAGCCACTGTGTGTGGAGCTGGATGAGCACAACCTGCCCCGGT 
TGNESKPLCVELDEHNLPRF 
3250 3270 3290 

TCCCCG AGT GGATCACCATCCCCCTGGT GT GCATCTACATGTT ATCC ACC AAC AT CCTGC 
PEW ITI PLVC IYMLSTNI LL 
3310 3330 3350 

TGGTCAACCTGCTGGTCGCCATGTTTGGCTACACX3GTGGGCACCGTCCAGGAGAACAATG 
VNLLVAMFG YTVGTVQEN N D 
3370 3390 3410 

ACCAGGTCTGGAAGTTCCAGAGGTACTTCCTGGTGCAGGAGTACTGCAGCCGCCTCAATA 
QVWKFQRYFLVQEYCSRLNI 
3430 3450 3470 

TCCCC!TTCCCCTT CAT CGT CT T CGCTTACTTCTACATGGTGGT GAAGAAGT GCTTCAAGT 
PFPFIVFAYFYMVVKKCFKC 
3490 3510 3530 

GTTGCTGCAAGGAGA?^AAACATGGAGTCTTCTGTCTGCTGTGAGTGGTTTATCCATGTGT 
CCKEKNMES SVCCEWFIH VY 
3550 3570 3590 

ACTTGGG AT CAG AAGCAG CG ATTAATTT CAGGGAAGGATGCCT GC AT CC AGTG AT TGG AA 
LGSEAAINFREGCLHPVI GS 
3610 3630 3650 

GCTGGACCCCAGGCTGGCTGGTCTGGACATCCACACGCATTCTCACATGCAGTGCCGGCT 
WTPGWLVV1T ST RILTCSAGW 
3670 3690 3710 

GGCCAGCAGCAGGGAGTCTCAGTGTCACCACACATAGCAGCTGGGTTCCTGCAAAAAGCA 
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PAAGSLSVTTHSSWVPAKSS 
3730 3750 3770 

GCAAGTCACAGGCCCACCCAGACAGAACGGGTAGAGAATGTGACTCTGCTTCTGGGTGGG 
KSQAHPDRTGRECDSASGWE 
3790 3810 3830 

AAGGACAGCCTGCCCGGTGGGTGGAAGAATCCGTGGCCCTGTTTGGCCATCGTGGCCCTG 
GQPARWVEESVALFGHRGPV 
3850 3870 3890 

TTTGGCCACCTACCACTCTAGGCATCACTGAGCTGAATGCGCCGGTCCTCTGA 
WPPTTLG ITELNAPVL* 



MVGGCRWTEDVEPAEVKEKMSFRAARLSMRNRRNDTLDSTRTLYSSASRSTDLSYSESASFYAAFRTQTCPIMASWDLV^ 
KKRECVFFTKDSKATEWCKCGYAQSQH^GTQINQSEKWNYKKHTKEFPTDAFGDIQFETLGKKGKY 

HWHLKTPNLVIS VTGGAKN FALKPRMRKI FSRLI YI AQSKGAWILTGGTHYGLMKYIGE WRDNT I SRS SEEK IVAI GI AAWGMVS 
NRDTLIRNCDAEGYFIAQYLMDDFTRDPLYILDNNHTHIiLVDNGCHGHPT\^AKLRNQLEKYISERTIQDSNYGGKIPIVCFAQG 
GGKETLKAINTSIKNKIPCVVVEGSGQIADVIASLVEVEDALTSSAVKEKL\^ 

IKMEEAGDEIVSNAISYALYKAFSTSEQDKDNWNGQIJCLLLEWNQLDLANDEIFTNDRRWEKSKPRLR 

KDVTDGKAS SHMLWLKSADLQEVMFTALIKDRPKFVRLFLENGLNLRKE*LTHDVLTELFSNH FSTLVYRNLQI AKN S YN DALLTF 

VWKLVANFRRGFRKEDRNGRDEMDIELHDVSPITRHPLQALFIWAILQNKKELSK^^ 

AAGESEELANEYETRAVGESTVWNAWGADLPCGTDIASGTHRPDGGELFTECYSSDEDIJ^^ 

QH FIAQPG VQNFLSKQWYGE ISRDTKNWKI ILCLFI I PLVGCGFVSFRKKPVDKHKKLLW YYVAFFTSPFWFSWNWFYIAFLLL 
FAYVLLMDFHSVPHPPELVLYSLVFVLFCDEVRQGRPAAPSAGPAKPTPTRNSIWPASSTRSPGSRSRHSFHTSLQAEGASSGLGQ 
PRKGWTFKNLEIWDISKLLMSLSVPFCTQWYVNGVNYFTDLWNVM^^ 

FGQVPSDVDGTTYDFAHCTFTGNESKPLCVELDEHNLPRFPEWITIPLVCIYMLSTNILLVNLLVAMFCTTVGTVQENNDQVWKFQ 
RYFLVQEYCSRLNIPFPFIVFAYFYMVVKKCFKCCCKEKNMESSVCCEWFIHVTLGSEAAINFREGCLHPVIGSWTPGWLVWTSTR 
ILTCSAGWPAAGSLSVTTHSSWVPAKSSKSQAHPDRTGRECDSASGWEGQPARWVEESVALFGHRGPVWPPTTLGITELNAPVL 



B . 



Q L 

2290 2310 2330 

TGCTGGTCTATTCCTGTGAAGCTTGGGGTGGAAGCAACTGTCTGGAGCTGGCGGTGGAGG 
LV YSCEAWGGSNCLELAVEA 
2350 2370 2390 

CCACAGACCAGCATTTCATCGCCCAGCCTGGGGTCCAGAATTTTCTTTCTAAGCAATGGT 
TDQH FIAQPGVQN FLSKQWY 
2410 2430 2450 

ATGGAGAGATTTCCCGAGACACCAAGAACTGGAAGATTATCCTGTGTCTGTTTATTATAC 
GEISRDTKNWKIILCLFIIP 
2470 2490 2510 

CCTTGGTGGGCTGTGGCTTTGTATCATTTAGGAAGAAACCTGTCGACAAGCACAAGAAGC 
LV'dCGFVSFRKKP VDK 
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a.) TrplOb cDNA and derived amino acid sequence 



10 30 50 

ATGAAATCCTTCCTTCCTGTCCACACCATCGTGCTTATCAGGGAGAATGTGTGCAAGTGT 
MKSFLPVHTIVLIRENVCKC 

70 90 110 

GGCTATGCCCAGAGC<^GC^C^TGGAAGGC^CCCAGATCAACCAAAGTGAGAAATGGAAC 
GYAQSQHMEGTQINQSEKWN 

130 150 170 

TACAAGAAACACACCAAGGAATTTCCTACCGACGCCTTTGGGGATATTCAGTTTGAGACA 
YKKH TKEF PTDAFGD I QFET 

190 210 230 

CTGGGGAAGAAAGGGAAGTATATACGTCTGTCCTGCGACACGGACGCGGAAATCCTTTAC 
LGKKGKYIRI1SCDTDAEI LY 

250 270 290 

GAGCTGCTGACCCAGCACTGGCACCTGAAAACACCCAACCTGGTCATTTCTGTGACCGGG 
ELLTQHWHIiKTPNLV I SVT G 

310 330 350 

GGCGCCAAGAACTTCGCCCTGAAGCCGCGCATGCGCAAGATCTTCAGCCGGCTCATCTAC 
GAKN FALKPRMRKl FS R I> I Y 

370 390 410 

ATCGCGCAGTCCAAAGGTGCTTGGATTCTCACGGGAGGCACCCATTATGGCCTGATGAAG 
IAQ S KGAW IIiTGGTHY GLMK 

430 450 470 

TACATCGGGGAGGTGGTGAGAGATAACACCATCAGCAGGAGTTCAGAGGAGAATATTGTG 
YIGEVVRDNTISRSSEENIV 

490 510 530 

GCCATTGGCATAGCAGCTTGGGGCATGGTCTCCAACCGGGACACCCTCATCAGGAATTGC 
AIGI AAWGMVSNRDTIj IRNC 

550 570 590 

GATGCTGAGGGCTATTTTTTAGCCCAGTACCTTATGGATGACTTCACAAGAGATCCACTG 
DAEGYFLAQYLMDDFTRDPIi 

610 630 650 

TATATCCTGGACAACAACCACACACATTTGCTGCTCGTGGACAATGGCTGTCATGGACAT 
YILDNNHTHLLLVDNGCHGH 

670 690 710 

CCCACTGTCGAAGCAAAGCTCCGGAATCAGCTAGAGAAGTATATCTCTGAGCGCACTATT 
PTVEAKIiRNQLEKYI S E R T I 

730 750 770 

CAAGATTCCAACTATGGTGGCAAGATCCCCATTGTGTGTTTTGCCCAAGGAGGTGGAAAA 
QDSNYGGKIPIVCFAQGGGK 

790 810 830 

GAGACTTTGAAAG C C ATC AATAC CTCCATCAAAAATAAAATTC CTTGTGTGGTGGTGGAA 
ETLKA INTS I K N K I P CVVVE 

850 870 890 

GGCTCGGGCCAGATCGCTGATGTGATCGCTAGCCTGGTGGAGGTGGAGGATGCCCTGACA 
GSGQ IADVIASLVEVEDALT 

910 930 950 

TCTTCTGCCGTCAAGGAGAAGCTGGTGCGCTTTTTACCCCGCACGGTGTCCCGGCTGCCT 
SSAVKEKLVRFIiPRTVSRLP 

970 990 1010 

GAGGAGGAGACTGAGAGTTGGATCAAATGGCTCAAAGAAATTCTCGAATGTTCTCACCTA 
EEETESWlKWIiKEILECSHL 

1030 1050 1070 

TTAACAGTTATTAAAATGGAAGAAGCTGGGGATGAAATTGTGAGCAATGCCATCTCCTAC 
LTVIKMEEAGDEIVSNAISY 

1090 1110 1130 

GCTCTATACAAAGCCTTCAGCACCAGTGAGCAAGACAAGGATAACTGGAATGGGCAGCTG 
ALYKAFSTSEQDKDNWNGQL 
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2410 2430 2450 

AGAAACTTAGGACCCAAGATTATAATGCTGCAGAGGATGCTGATCGATGTGTTCTTCTTC 
RNLGPKIIMLQRMLIDVFFF 

2470 2490 2510 

CTGTTCCTCTTTGCGGTGTGGATGGTGGCCTTTGGCGTGGCCAGGCAAGGGATCCTTAGG 
LFLFAVWMVAFGVARQGILR 

2530 2550 2570 

CAGAATGAGCAGCGCTGGAGGTGGATATTCCGTTCGGTCATCTACGAGCCCTACCTGGCC 
QNEQRWRWIFRSVIYEPYLA 

2590 2610 2630 

ATGTTCGGCCAGGTGCCCAGTGACGTGGATGGTACCACGTATGACTTTGCCCACTGCACC 
MFGQVPSDVDGTTYDFAHCT 

2650 2670 2690 

TTCACTGGGAATGAGTCCAAGCCACTGTGTGTGGAGCTGGATGAGCACAACCTGCCCCGG 
FTGNE SKPLCVELDEHNLPR 

2710 2730 2750 

TTCC C CGAGTGG ATC ACCATCCCCCTGGTGTGC ATCTAC ATGTTATCCACCAAC ATCCTG 
FPEW I TIPLVCIYM-LSTNIIi 

2770 2790 2810 

CTGGTCAACCTGCTGGTCGCCATGTTTGGCTACACGGTGGGCACCGTCCAGGAGAACAAT 
Ii V N L L VAMFGYTVG TVQENN 

2830 2850 2870 

GACCAGGTCTGGAAGTTCCAGAGGTACTTCCTGGTGCAGGAGTACTGCAGCCGCCTCAAT 
D QVW KFQRY FLVQ E YC S R L N 

2890 2910 2930 

ATCCCCTTCCCCTTCATCGTCTTCGCTTACTTCTACATGGTGGTGAAGAAGTGCTTCAAG 
I PFPFIVFAYFYMVVKKCFK 

2950 2970 2990 

TGTTGCTGGAAGGAGAAAAACATGGAGTCTTCTGTCTGCTGTTTCAAAAATGAAGACAAT 
CCCKEKNMESSVCC FKNEDN 

3010 3030 3050 

GAGACTCTGGCATGGGAGGGTGTCATGAAGGAAAACTACCTTGTCAAGATCAACACAAAA 
ETIiAWEGVMKENYLVKINTK 

3070 3090 3110 

GCCAACGACACCTCAGAGGAAATGAGGCATCGATTTAGACAACTGGATACAAAGCTTAAT 
ANDTSEEMRHRFRQIiDTKLN 

3130 3150 
GATCTCAAGGGTCTACTGAAAGAGATTGCTAATAAAATCAAATAG 
DLKGLLKEIANKI K* 



b.) TrplO protein: 

MKSFLPVHTIVLIRENVCKCGYAQSQHMEGTQINQSEKWNYKJC^ 

ELLTQHWHLKTPNLVI SVTGGAKNFALKPRMRKI FSRLI YIAQSKGAWILTGGTHYGLMKYIGEVVRDNTISRSSEENIV 

AIGIAAWGMVSNRDTLIRNCDAEGYFLAQYLMD^ 

QDSNYGGKIPIVCFAQGGGKETLKAINTSIKN^^ 

EEETES W I KWLKE ILECSHLXiTVI KMEEAGDE I VSNAI S YAL YKAFSTSEQDKI)NWNGQIjKIjLIjEWNQIjDIjANDE I FTND 

RRWESADLQEVMFTALIKDRPKFVRLFLENGLNDRKFLTHDVLTELFSNHFSTLVYRNL 

FRRGFRKEDRNGRDEMDIELHDVSPITRHPLQALF^ 

AGESEELANEYETRAVELFTECYSSDEDLT^QLLWSCEAWGGSNCLEI^VEATDQHFIAQPGVQNFLSKQWYGEISRDT 
KNWKI I LCLF 1 1 PLVGCGFVS FRKKPVDKHKKLLWYYVAFFTS PFWFS WNWFYIAFLIiLFAYVLLMDFHSVPHPPELV 
LYSLVFVTiFCDEWQWYWGVXreFTDLW^ 

RNLGPKI I MXjQRML IDVFFFL FLFAVWMVAFGVARQG I LRQNEQRWRWI FRS VI YE P YLAMFGQVPSDVDGTT YDFAHCT 

FTGNESKPLCVELDEHl^PRFPEWITIPLVCIYM 

IPFPFIVFAYFYMWKKCFKCCCKEKIWESSVCCFKN^ 

DLKGLLKEIANKIK 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-12, 29-31 partially, 13-28 completely 

Isolated nucleic acid molecules encoding human prostate 
carcinom associated proteins as characterized by 
SEQIDs5,45,ll,3 and SEQIDs 6,46,12,4, respectively; the 
recombinant expression of the same in host cells; the 
isolated proteins as characterized by SEQIDs 6,46,12,4; 
anti sense RNA sequence and ribozyme complementary to said 
nucleic acid molecules; inhibitor that can suppress the 
activity of said prostate carcinom associated proteins; 
method for diagnosing a prostate carcinoma by contacting a 
sample with a nucleic acid, an antibody or other reagent 
that reacts with the mRNA of SEQIDs5,45,ll,3; method for 
diagnosing endomertial cancer by contacting a target sample 
with a nucleic acid, an antibody or other reagent that 
reacts with the mRNA of SEQIDs5,45,ll,3; method for 
diagnosing a melanoma, chorion carcinoma, cancer of the lung 
and ot the prostate comprising contacting a target sample 
with a reagent which detects anti sense RNA of SEQIDs 11 and 
3; method for preventing prostate tumour, endometrial 
cancer, choroin carcinoma or cancer of the lung comprising 
administering an inhibiting reagent of human prostate 
carcinom associated proteins; diagnostic kit containing an 
antibody; method for identifyng an agonist or an antagonist 
of human prostate carcinom associated proteins. 



2. Claims: 1-12, 29-31 partially 

Isolated nucleic acid molecule encoding human prostate 
carcinom associated protein as characterized by SEQIDs 7 and 
SEQIDs 8, respectively; the recombinant expression of the 
same in host cells; the isolated protein as characterized by 
SEQIDs 8; anti sense RNA sequence and ribozyme complementary 
to said nucleic acid molecule; inhibitor that can suppress 
the activity of said prostate carcinom associated protein; 
diagnostic kit containing an antibody; method for identifyng 
an agonist or an antagonist of human prostate carcinom 
associated proteins. 
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Continuation of Box 1.2 
Claims Nos.: 12 partially 

Present claim 12 relates to an inhibitor wich is defined by reference to 
a desirable characteristic or property, namely suppressing the activity 
of the protein of claim 6. 



The claims cover all inhibitors having this characteristic or property, 
whereas the application provides only support within the meaning of 
Article 6 PCT and/or disclosure within the meaning of Article 5 PCT for a 
limited number of such inhibitors. 

In the present case, the claims so lack support, and the application so 
lacks disclosure, that a meaningful search over the whole of the claimed 
scope is impossible. Independent of the above reasoning, the claims also 
lack clarity (Article 6 PCT). An attempt is made to define the inhibitors 
by reference to a result to be achieved. 

Again, this lack of clarity in the present case is such as to render a 
meaningful search over the whole of the claimed scope impossible. 

Consequently, the search has been carried out for those parts of the 
claim 12 which appear to be clear, supported and disclosed, namely those 
parts relating to the Trp8/10 corresponding antibody , Trp8/10 
corresponding an ti sense construct, a Trp8/10 corresponding nbozyme. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. Tnis is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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